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EMPIRICAL EVALUATION OF SHORT W-B SCALES* 
ARNOLD H. HILDEN and JAMES W. TAYLOR AND PHILIP H. DUBOIS 
VA Regional Office, St.. Louis Washington University 


INTRODUCTION 


Since the appearance of Rab‘n’s article“® on a short form of the Wechsler- 
Bellevue Scale, a continued interest in this subject has been shown.“ © 5 ™, A var- 
iety of short scales have been studied, ranging from two to five subtests, a few ox 
them using only verbal subtests, but generally including both types. Throughout 
this discussion, we use the system of abbreviated notation proposed by Rabin and 
Guertin “7: single letters for the verbal subtests (I; C, A, D, 8, and V for subtests 
Information, Comprehension, Arithmetic, Digit Span, Similarities. and Voeabulary, 
respectively), and two-letter combinations for the performance subtests (PA, PC, 
BD, OA, and DS for subtests Picture Arrangement, Picture Completion, Block De- 
signs, Object Assembly, and Digit Symbol, respectively). W-B is the abbreviation 
for the Wechsler-Bellevue Intelligence Scale, while VS, PS and FS stand for the 
Verbal Scale, the Performance Scale and the Full Scale, respectively. CS(V) refers 
to Hunt’s short scale“, *. *, in which a short form of the Stanford-Binet vocab- 
ulary test replaced the vocabulary subtest of W-B. 


Short Forms Studied. The following combinations of subtests utilized as short forms 
have been reported: pairs: C =, A&,5 84)» T) and PA®, 3%, 7, 8, 14); D and DS“; § 
and BD“); triads: I, C, BD; C, D, 8®; C, A, S@, 3, & ™, 16, 19; Gs S, (V) & % 1: 
GY, DS “4, 6): quartets: I, §, Vv : ‘Ap. I, PA, PC, DS®.™:;C, D, 8S, BD®. 3. ©: C, 
8, PA, BD“; C, V, PC, BD. ™; quintet: C, D, A, 8, PA®), and McNemar’s 
study “*), which reports the best pairs, triads, quartets and quintets. 


Criteria Used. As \a general rule, the criterion for the short forms was FS, either in 
terms of the weighted score or in terms of the IQ, and usually without the inclusion 
of subtest V%. 4: & 1, 4, 16, 19: in two exceptions“! %), V was included. In one 
study, the criterion used was VS IQ; another group of studies“: ', used two 
criteria, the Navy General Classification Test and a five-subtest short form which 
had been validated against FS; and another study’, in addition to using FS ex- 
clusive of V, also used FS minus his short battery. 


Populations. The size of the samples studied varies from 32°? to about 500, § ?” 


With one exception “*), using 355 cases from Wechsler’s standardization group, the 
studies have been based on atypical groups of one sort or another. Among these 
have been court cases in a short-contact clinic “’; institutionalized mental defect- 
ives“. 4); superior individuals®); white male recruits of questionable intellectual 
ability ©); Federal prisoners, unselected, mainly psychotic, physically handicapped 
and otherwise maladjusted patients’, and Federal prisoners, with some special 
psychometric or psychiatric reason for referral ©); Naval recruits, randomly selected, 
without neuropsychiatric diagnosis“: '°; clinic cases referred for various emotional 
problems, but with no psychotics included “'’; closed ward Army hospital patients, 
with severe psychiatric problems, involving the major neuroses and psychoses and 
with several organic brain cases “*); a mixed group, combining male veterans at a 
guidance center, graduate students, Army hospital cases with no psychiatric dis- 
ability, and Rapaport’s“* group of state highway patrolmen“®); female student 
nurses and state hospital patients selected at random from the files “®); and Naval 
recruits suspected of being mentally retarded, none presenting any psychiatric 
problems “?, 


*Reviewed in the Veterans Administration and published with the approval of the Chief Medical 
Director. The statements and conclusions published by the authors are the result of their own study 
and do not necessarily reflect the opinion or policy of the Veterans Administration. From the 
Veterans Administration Regional Office, St. Louis, Missouri. 
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MecNemar’s study “*?, which was not included in the above list, is the only study 
based on a representative group of subjects. However, it is unfortunate that the 
Vocabulary subtest was not included in his study, because of the absence of data on 
that subtest in Wechsler’s manual. This is generally considered one of the most 
important subtests in the scale. 

With regard to other aspects of the population studied, such as race and sex, 
the studies vary considerably, and often suffer in completeness by not reporting 
data on these points. As to sex, male and female are included in some“: *: '®; female 
only in one subgroup “®; male, or presumably male, only in the others. Wechsler 
does not feel that there are statistically significant differences in the total score be- 
tween sexes page 196) With regard to race, of those reporting racial data, some in- 
volve white male subjects only®:- ™, and others include a mixture of whites and 
negroes, while one“? included some Chinese. Where both negroes and white persons 
are involved, some studies report the relative proportion: 7), but most do not refer 
to it. In some instances, ihe author may have avoided the inclusion of negroes and 
made no specif : mention of it, but in view of Wechsler’s opinon™ Pe 107) that 
his “norms cannot be used for the colored population of the United States,” it would 
seem a point important enough to recognize and clarify in a report. 


Age. The ages of the subjects in most reports tend to fall in the late teens or early 
twenties. Five studies show an average of about 23 years, with an SD of 5 to 7 years, 
and with a range of from late teens to close to forty years®: + ° “, ™. In two 
studies “: , the average age was about 17 years, with a narrow SD of only .26 years, 
and with ranges of 17 to 20 and 17 to 25 years. A few studies“. '* '*), report only the 
range, and one study “”? gives no data on age, but the population consists of adults 
(Federal prisoners). McNemar’s study was based on data for age groups 20 to 34 
years. 


Intelligence Level and Distribution. Most of the studies involve populations below 
average intelligence. Two groups“. *), are restricted to mental defectives and border- 
line cases with an average 1Q of about 65. Two other studies have an average IQ 
in the borderline classification“. ™, but the individuals range from mental de- 
fective to bright or superior. Another report is based on a group averaging in the 
dull normal classification “*, ranging from borderline to bright. In another publica- 
tion), the mean IQ 109 is about average, with subjects ranging from borderline 
to very superior. The group of Naval recruits: ' are close to the population at 
large in both mean ability and range; these studies use a short scale which is not 
composed entirely of Wechsler-Bellevue subtests. 

The dispersion of obtained IQ’s may be compared with the theoretical SD of 
14.83 for the distribution of W-B [Q’s. The SD’s obtained on the populations with 
FS range from as low as 7.66, to as high as 20.66. The significance of this factor in 
relation to the size of the correlation coefficient does not appear to have been recog- 
nized adequately, or at least not given explicit recognition in the published reports. 
Only one author makes a definite comment on this point. Having observed that the 
size of the r obtained by him was lower than that obtained by another author using 
the same brief scale, he states that the narrow range of scores is possibly responsible, 
but the SD of his group is only about half a point less than that of the other group. 
Another author, comparing two groups of subjects, notes that the group with wider 
range of intelligence gave results that were ‘even more in agreement,”’ i.e., produced 
a higher correlation between the brief scale and FS. It is difficult to know whether 
this indicates a recognition of restriction of range. The respective SD’s of the two 
groups are not given. 

Inspection of the results of these various studies indicates that differences in 
variability in the samples are not sufficient to account for the obtained differences 
in correlation. Other factors are also operating. There is the question of the degree 
of intercorrelation of the particular subtests used in any given brief scales, as well as 
the correlation of each subtest with the criterion, the FS. The number of subtests 
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ysed is also an important consideration, the more tests the greater the r would tend 
to be, other factors being equal. There is the question of whether relations differ at 
different intelligence and cultural levels, or of differences in functioning of particular 
classes of patients on different subtests and for different levels of difficulty. 


RESULTS OF THE PRESENT STUDY 


The population of our study was drawn from cases seen at a VA Mental Hygiene 
Clinic, most of whom are classified as psycho-neurotics. All patients who had. been 
tested up to June, 1949, were included, with the exception of 12 cases eliminated be- 
cause of race (Negro), age (above 44 or below 20 years), or intelligence level (IQ 
below 67). This resulted in a group that consisted of 161 white male veteran patients, © 
none over 44 or below 20 years of age, and with a minimum IQ of 67. The restriction 
of age range served the purpose of limiting the subjects to World War II veterans, 
and also coincides with ages covered in Wechsler’s standardization tables@ Tables 
wand 40) The cross-validation group consisted of 81 patients tested subsequent 
to June, 1949. 

As can be seen from Table 1, both the experimental and cross-validation groups 
yield statistical values fairly representative of the general population. The mean 
Full Seale IQ and SD are close to those of Wechsler. While the mean and SD of 
the subtests show some deviations, they are quite close to those of Wechsler’ for 
this age range. Since Wechsler presents his subtest means and SD’s in smaller age 
groups (20-24, 25-29, 30-34, 35-39, 40-44) than the present data, his results were 
combined for comparison. The following formulas were used: 


M. = NaMa + NvoMp + NeMe + NaMa +r NeMe 
ee ee eee ae a 4 
N 


[Na(od-+MB) + No(ob-+ME) 4 Na(o2-+ M2) +... (NaMa + NoMy + NeMp +. ? 


= N N? 


Taste 1. ComPpaRIsON OF EXPERIMENTAL AND CROss-VALIDATION GROUPS WITH A SAMPLE OF 
WECcHSLER’s STANDARDIZATION Group (AGpEs 20-44) 








Experimental Cross Valid. Wechsler 
Group Group Age 20-44 

Subtests N=161 N=81 N =530 
M | sp | 





Information 

Comprehension 

Digit Span 

Arithmetic 

Similarities 

Vocabulary 

Picture Arrangement t 
Picture Completion 9.6: 3.34 9.60 
Block Designs 9. 3.17 9.79 
Object Assembly 9.6 2.9% 64 
Digit Symbol 8.: = 8.41 


| WOO CO COON OO COO Ge | 





Verbal Wtd. 44.54 2.7: 44.16 | 15.14 
Performance Wid. | 45.11 9% 46.98 | 13.29 
Full Seale Wtd. 89.65 22.8 91.14 | 25.91 





Verbal IQ 97.61 . 97.06 | 16.87 
Performance IQ 97.53 5.3 Rte) 2.10 OO Liss nsdenlen hers s 
Full Seale IQ 97.52 98.53 17.67 {100.00 14.8: 





Age 29.07 5.84 29.69 
Education 9.21 3. 8.88 
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In these formulas, Mx and ox represent the subtest mean and SD for the total 
range. Na, Ma, and o, represent the number, mean and SD of a subtest for a p 
cular age range; subscripts b, c, d, and e represent the other age ranges. 

The average subtest mean and SD for the experimental group, not inclu 
Vocabulary, is 8.97 and 3.28. In comparison, Wechsler’s group has a mean of 
and SD of 3.13. The age range used by McNemar was 20-34 and this range has an 
average subtest mean of 9.96, and SD of 3.01. Our means are slightly lower and our 
SD’s slightly greater than those found with the standardization group. 

Tables 2 and 3 present the intercorrelations of sub-tests and their correlations 
with the Full Weighted score and the Full Scale IQ. There are two additional rows 


ig 
( 


ma) 


TABLE 2. TABLE OF INTERCORRELATIONS OF SUBTESTS IN EXPERIMENTAL GrouP N = 161 


Subtests , D , . y BD OA DS 


>A 
t- 
BD 
OA 
DS 


F Wtd. 
F IQ 


Corrected r 
F Wtd. 
F IQ 


TaBLeE 3. Tasie or [| ALIDATION Group N 


Subtests D ! F PA PC BD 


"A 
BD 
OA 


. 64 
DS 


.68 
F Wtd 


F 1Q 


.80 
Corrected r 
F Wtd. 


80 
F IQ 


74 


of correlations. These are the corrected correlations of the Full Wtd. and the Ful! 
IQ. As previously noted, earlier studies have utilized a wide variety of subjects 
As a result, means of these populations are frequently far removed from the norma! 
Also the standard deviations have been much smaller in some cases and much large! 
in others. This makes comparison of results almost impossible. A correction can b: 
applied to the standard deviation to estimate results which would have been o! 
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tained had the sampling been more accurate. The following formula was used to 
correct “3, P- 126); 


. 


a) 


“x 
Ixy ~ 
Ox 


Ryy = : - 
\ l-r?y + ry —3 


Ox 


{xy is the corrected correlation, rxy the obtained correlation, 2, the known stand- 
ard deviation of the general population (in Wechsler’s data, 14.8258 for the Full 
Seale [Q, and 22.2929 for the Total Weighted), and ox the obtained standard devia- 
tion. 

This correction serves two purposes with the present data. It allows the results 
of the experimental and cross-validation groups to be compared more adequately, 
and the corrected values can be compared with correlations previously reported. 
While some discrepancies exist between the correlations of the subtests and the 
Total Weighted for the two groups, the differences do not seem to be important. 

In the selection of the best combinations of subtests the Total Weighted score 
was used as the criterion rather than the Full Scale IQ. There are two reasons for 
this choice. When a combination of subtests is used, the score which will be obtained 
is the total weighted or an approximation of this. This approximation will then be 
used to enter the IQ tables for that particular age. It is logical then that the cor- 
relation with the Total Weighted is the most important, and avoids the interfering 
affect of age on IQ as determined by the conversion tables. These are extraneous 
factors to the development of a short form. Actually the correlations of the subtests 
with FS Weighted Score and FS IQ are in close agreement. It may be noted in addi- 
tion that it would be sounder statistically to correlate the subtests with the Total 
Weighted minus that subtest. However, in practice we are not trying to predict 
what the total would be if-the subtest were not included, but rather what the total 
is including this subtest. 

In Tables 4 and 5 the ten best combinations of two and three sub-tests are listed. 
All combinations were calculated using the following formula, in which t stands for 
the Total Weighted (criterion), s for any sub-sum of subtests a+b+ec-+.. .. ee 


TatTa + Mote + Tet%e + TarFa 


ry 


. soe re re ‘ ‘ ‘ “1 7 
Vogtopntoet Cat 2rapTaT ‘2 2racFaFc 2 2raaTaFa T 2rpeF pF ca 2rpatnFa t+ 20 eaTeFa 
TaBLEe 4. OprarnepD CORRELATIONS OF THE TEN Best CoMBINATIONS OF Two TgEsts witn ToTaAL 


WEIGHTED ScorE AND Fuit 1Q, ToGETRER with CORRELATIONS CORRECTED TO STANDARD 
Deviations ReporTep BY WECHSLER 


Total Weighted Full IQ 


Pairs Experimental Cross Valid. Experimental 
of Group Group Group 
Subtests N = 161 N=81 N = 161 
Obt. ‘or. Obt. Cor. Obt. Cor. 


91 .{ 93 91 89 89 
91 4 92 89 87 87 
91 .§ 4 .92 
90 4 : SS SS SS 

88 


.83 .! 88 
86 86 86 


86 86 RD 


86 87 


ARNON GOMD < | 
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Tas_e 5. Osratnep CORRELATIONS OF THE TEN Best CoMBINATIONS OF THREE TEsTs WITH 7 
WEIGHTED Score AND Futy IQ, ToGeTHER wirH CORRELATIONS CORRECTED TO STANDA} 
DEVIATIONS REPORTED BY WECHSLER 


Total Weighted Full IQ 


Triads Experimental Cross Valid. Experimenta 
of Group Group Group 
Subtests N=161 N=81 N=161 
Obt. Cor. Obt. Cor. Obt. C 


0 


> 


BD Of .94 .95 .93 91 9 
BD 94 .93 .94 .93 91 

BD 94 .93 .96 .94 .92 92 
PC .94 .93 .93 91 .93 93 
BD .94 .93 .95 .94 .92 92 
BD 94 .93 .95 .94 .92 .92 
Ds 94 .93 .93 91 .90 89 
BD .94 .93 .94 .93 .92 91 
BD .93 .93 .94 .93 .92 92 


BD .93 .93 .93 .90 .92 92 


~~ 


~ 


eer Re™ 


el Ae ee ae 
peed feed teed bed cot ed A AS ot bere 
Tt dA aoe , 


~* 


All possible combinations of two and three subtests were calculated, and the 
ten best selected for presentation. Those selected were then checked by calculating 
the result for that combination in the cross validation group. The correlation of the 
experimental group with the IQ was also obtained. These three correlations for each 
selected combination were then corrected for standard deviation. 

There are 55 possible combinations of two subtests; the range of correlations is 
from .911 for V and BD to .716 for D and OA. This latter combination has a lower 
correlation than that of most of the individual subtests. In Table 7 the present data 
are compared with combinations previously reported. Since McNemar used Wech- 
sler’s standardization data, a comparison with his results holds more meaning than 
a comparison with a number of the other studies in which the mean and SD are often 
very discrepant. In using all possible correlations for selecting the ten best combina- 
tions of two tests, there are three agreements with the list presented by McNemar. 
If V is eliminated from our combinations, there are two more agreements with 
McNemar. 

There are 165 possible combinations of three subtests. The range of correla- 
tions is from .942 for 8, PA and BD to .827 for D, OA and DS. This latter correla- 
tion is below that of V alone, .845. The overall agreement of our best ten with Me- 
Nemar’s is two; however, if V is eliminated there are four agreements. It is apparent 
that V correlates higher with Total Weighted than any of the other subtests, and also 
enters into the ten combinations very frequently. Since Wechsler does not present 
data on the V test, McNemar’s selection of subtests is necessarily limited. This is 
not only unfortunate for comparative purposes, but also for clinical usefulness, since 
this subtest is one of the most valuable for obtaining qualitative material about a 
patient. 

The correlation of 8, PA, BD with T Wtd. is .942, and that of 8, V, BD is .937 
While the latter is slightly lower, it would be preferable for clinical usage since the 
substitution of V for PA reduces the correlation by a very small amount, but adds 
tremendously to the qualitative diagnostic value. This material may be suggestive 
of distorted thought processes, which are not considered in the intelligence level as 
such, and which could not be obtained from PA. 

To obtain combinations of four tests, reported in Table 6, the four best combin- 
ations of three were used as a core, and the remaining subtests were added to this 
Therefore, the results do not necessarily represent the best possible combinations o! 
four subtests. A complete four test analysis would probably not give significant!) 
different results. It is obvious that with more subtests, a higher correlation will be 
found, since the subtests are a part of the total that is being predicted. There is 
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point of diminishing returns, and this appears to be with the addition of a fourth 
test, the increase being only .02, from .94 to .96. One might also question the pur- 
pose of a short form. If the purpose is to save time and to accomplish the best job 
with the smallest number of tests, three subtests give adequate results. 


TasLe 6. OBTAINED CORRELATIONS OF THE Srx Best CoMBINATIONS OF Four Tests wita Toran 
WercHrTep Score anv Futz IQ, ToGerHer with CoRRELATIONS CORRECTED TO STANDARD 
DeviaTIons REPORTED BY WECHSLER 


Total Weighted Full IQ 
Experimental Cross Valid. Experimental 
Group Group Group 
N=161 N=81 N = 161 
Subtests Obt. Cor. Obt. Cor. Obt. Cor. 


PC, BD .96 .96 96 95 95 95 
PA, BD .96 .96 96 95 .93 .93 
BD, DS 96 .96 95 94 .93 .92 
BD, DS 95 95 .96 94 .92 .92 
PA, BD .95 95 .96 95 94 .94 
PC, DS 95 .95 96 94 4 94 


Table 7 compares previous studies with present results. Several of the reported 
correlations have been corrected to the SD of the standardization data. This was 
not possible in all cases because the obtained SD’s were not reported. It was, of 
course, not necessary to correct any of McNemar’s correlations. There is a variation 
in previously reported correlations from .53 to .97. When all of these combinations 
are calculated on the same set of data, there is very little variation. In the present 
data the correlations of combinations of two tests range from .911 to .716, a differ- 
ence of .195; combinations of three tests have correlations ranging from .942 to .827, 
a difference of .115, The range decreases with the addition of more tests, and one 
might well conclude that it actually does not make too much difference what 
selection is made. 


DIscUSSION 


We have discussed briefly the literature on sbort forms of the Wechsler-Bellevue 
Seale. One of the difficulties encountered in attempting a satisfactory summary of 
these studies has been the great variation in the character of the groups studied, and 
particularly the lack of completeness in what is reported. Some of the obstacles 
found were the use of racial groups to which the norms cannot be validly applied, 
the use of groups whose mean level differs markedly from that of the general popula- 
tion, failure to take into account the variation of the distribution of ability com- 
pared with other groups and with the general population, not taking into account 
the fact that correlation with the full scale is a function not only of the number of 
tests in the brief battery but also of correlations between the individual subtests. 
Some of these latter relationships may be altered as the mean level of the group falls 
at, below, or above the ability level of the population on which the test was standard- 
ized. Some of these problems were not met in McNemar’s study, since he used the 
data obtained on the standardization group itself. He felt that results obtained in 
this way would give results more valid statistically than one based on clinical popula- 
tions. We feel that it is of value to carry out a similar study with some completeness 
on a clinical group shown to have acceptable agreement with regard to statistical 
values pertinent to this type of investigation. A point of further importance is that 
since much use is likely to be made of short forms in clinical settings, it is advisable 
to determine if careful study reveals that findings for a normative group also hold 
generally on a clinical group. In addition, McNemar’s study did not include the 
Vocabulary subtest, one of the most important tests in the battery both from the 
standpoints of predictive accuracy and clinical value. 
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TABLE 7. OBTAINED CORRELATIONS OF COMBINATIONS PREVIOUSLY REPORTED, TOGETHER wr' 
CORRELATIONS OBTAINED FOR THESE COMBINATIONS, ALL CORRELATIONS CORRECTED, W 
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McNemar ) .93 
. Cotzin 
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Patterson .96 
Patterson .96 
McNemar ( 9) .93 
McNemar ( 7) .93 
McNemar ( 5) .93 
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[, 8, BD, DS 
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C, D, 38, BD 


C, A, PC, DS 
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C, 8, PA, BD 
( 
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Db, 8, PC, BD 
A, 8, PC, DS 
A, 8S, BD, DS 
S, PC, BD, DS 


NN NO OD ND Or eS DS DD SD Ww Ww 


“Combinations represented in Tables 4 and 5 as being among the ten best. 
( ) Numbers after references to McNemar indicate the order of his combinations. 





EMPIRICAL EVALUATION OF SHORT W-B SCALES 


SUMMARY 


The present study included an experimental group of 161 cases and a cross valid- 
ation group of 81 cases. These groups gave results which are consistent and quite 
similar to the standardization group of Wechsler in level and range of ability. All 
possible combinations of two and three subtests were calculated and the best ten 
presented. Several combinations of four were also evaluated, but it seems that a 
point of diminishing returns is reached between a battery of thre » and a battery of 
four subtests. 

The combination which seems most fruitful clinically and yields a high relation- 
ship with the Fu‘! Scale is Similarities, Vocabulary, and Block Designs, which cor- 
related .937 with Total Weighted Score. One eombination of three subtests actually 
correlates slightly higher, but the slight gain in prediction would be more than offset 
by the sacrifice in clinical value coming from loss of the Vocabulary subtest. 
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THE USE OF THE KENT EGY FOR THE DETECTION OF MALINGERDD 
MIRIAM E. CROWLEY 


Cushing Veterans Administration Hospital 


Malingering, the feigning of a disability, is important to physicians aud psycho- 
logists in the armed services. It is desirable to find a means of detecting feigned 
feeblemindedness, which we shall henceforth designate as malingering. Hunt | 
Older“? describe some differences in performance on several mental tests betwee: 
feebleminded men and malingering men. They offer no specific scoring stencil 
Pollaczek “ and Goldstein®? describe keys for the detection of malingering. In 
Hunt and Older’s study, the mean scores of the feebleminded group were below those 
of the malingerers. Goldstein’s malingerers had mean scores significantly below thi 
means of the control group, made up of ‘‘genuine failures.”’ Pollaczek found no sig 
nificant differences in the means of her experimental and control groups. In all thes: 
studies each group consisted of about 50 men. 

The goal of the present study was to identify some specific quantitative sig 
on the Kent Emergency Test, that would make possible the detection of persons who 
try to feign mental deficiency. The Kent was chosen because of its simplicity, and 
because answers on it seemed likely to yield differentiating cues. 


( 
A 


i 


SUBJECTS 
Four groups of women were used as subjects (Ss). Letters used to identify each 
group appear in Table 1 together with the method of testing and the number of cases 


TasBLe 1. Sussyecrs ANB METHOD oF TEsTING. 


Group Method 


College ‘‘malingerers”’ Oral, individual 
State School feebleminded Oral, individual 
College “‘malingerers” Written, group 
College “straight” Written, group 
Non-college ‘“‘malingerers” Oral, individual 


D’ Non-college “straight” Oral, individual 


“Parentheses are used to indicate use of same Ss as in preceding group. 


Group A consisted of 59 women, ranging in age from 17 to 22 years, who wer 
selected at random from four classes in a women’s college. Group B consisted of 58 
feebleminded women who were residents in a state school. They were matched 
with the college Ss for age. The mean mental age of the feebleminded Ss, based on 
the 1916 Stanford-Binet, was 9 years 11 months, with a range of 8-0 to 12-0. Only 
women who were not obviously feebleminded in appearance were used as Ss in this 
group. 

To determine how college women normally answer the Kent questions, the test 
was administered as a group test to 176 students in an introductory class in psy 


‘From the Clinical Psychology Division of the Neuropsychiatric Service, Cushing Veterans A 
ministration Hospital, Framingham, Massachusetts. 

'The writer wishes to thank Miss Mathias, psychologist at Belchertown State School, for her hel; 
in securing the Ss for this group, and the management and staff of this institution for their co-operati 
in making the research possible. 
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chology. For lack of a better word, we refer to this procedure as taking the test 
“straight.’’ This group was also asked to feign feeblemindedness on the written test, 
so we obtained two groups from these Ss, namely C’ and C (see Table 1). These Ss 
were similar to group A in age and IQ. Intelligence tests were not administered to 
groups A and C since wé could reasonably assume the IQ’s of the college students 
to be at least at the bright normal level. 

Group D was added in order to include non-college malingerers in the study. 
This group was. made up of 6 nurses and 5 secretaries of average intelligence 
(M = 107; Range 97-113), who were not in college and had not attended college. 
Their ages ranged from 19 to 22. These 11 Ss were selected from 18 female hospital 
employees who were similar in age to groups A, B and C, and whose IQ’s were de- 
termined by means of SRA Verbal Form (AH). Ss with IQ’s beyond the middle or 

‘normal’ range of that test were not included in the test. Thus we excluded Ss who 
scored low enough to be considered feebleminded and those bright enough to be 
“eollege material.’’ This group also took the test twice, as shown in Table 1, and is 
included as groups D and D’. 


PROCEDURE 

All groups were given the original Kent Emergency Test, containing 25 ques- 
tions“), Members of groups A, B, D and D’ were tested individually and gave 
answers orally. Three qualified students’ assisted in testing group A, and the writer 
tested all Ss in groups B, D and D’. Groups C and C’ were tested in groups of about 
30, during a psychology class. The test was mimeographed for these Ss who wrote 
their answers on the question sheet. 

The following instructions were read when the test was given to groups A, C 
and D. 


“During the war, an experiment was conduc ted on Navy men in order to investigate the problem 
of malingering, that is, of pretending to have defects in order to avoid military service. The pur- 
pose was to study the ‘different, wi ays that intelligence test questions were answered by men who 
were imitating malingerers, and men who were really feebleminded. Now, we are trving to study 
the differences between normal women who are pretending to ms linge . and really feebleminded 
women. I am going to give you a brief intelligence test. You are to imagine that women are being 
drafted and that you want to be eliminated from this draft. You have decided to try to get out 
of it by acting feebleminded. Try to answer the questions in the way you believe a mentally de- 
ficient person would. Don’t forget, you are trying to appear feebleminded.”’ 


RESULTS 


Our data showed quantitative and qualitative differences between malingerers 
and truly feebleminded people. The differences noted here are suggested as cues 
for the detection of malingering. 


Quantitative Differences in Mean Scores. The mean score of each of the malingering 
groups, (A, C and D) is lower than that of the feebleminded group (B), see Table 2. 
Differentiating a group of malingerers from a group of truly deficient Ss seems poss- 
ible from the mean scores of groups. Scores of groups A and B, B and C, and B and 
D have a large overlap however, and identifying an individual malingerer by his 
total score may be possible only if it is an extremely low score. 

Our data were the raw scores obtained from the total earned? on all 25 items of 
the original Kent Emergency Test. Thus we have a greater number of answers with 
which to make comparisons than are obtainable from Scale D, the 1941 Revision’, 
which consists of only 10 questions. Age levels computable on Scale D“? were not 
used because so many of the malingerers scored too low, i.e., below 10, and they 
could not be rated on the age scale.‘ 


*The writer is indebted to Miriam Leveton, Nancy Nore and Joann Smith for their kind assist- 
ance with the testing. 

‘All scoring was done by the writer. Lewinski’s‘”) response list was used as an aid. 

‘Only 13 Ss in group A and 12 in C could be assigned a Kent Mental Level. 
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TABLE 2. MEAN Raw Score, STanDARD DEVIATION AND 
RANGE OF Eacn Grovp. 


Group N Mea Sigma Range 


ifferences between the mean scores of groups A and B, B and C, and B and D 


are atistic ally significant. Critical ratios are given in Table 3. The difference 
tween the written and oral mean secres of the college Ss is significant and the 


groups A and C are not pooled and treated as a single group. 


Tas_e 3. Critica, Ratios BErwEEN MEANs or ALL Groups 


The differences between the mean scores of the non-college Ss (D) and 
college Ss (A and C) are not significant either for oral or written malingering 
(CRs .05 and 1.80). It appears that intelligence, at least at the average ar 
above-average levels, has no effect on the feigning of feeblemindedness. 

The means obtained when the test was taken “‘straight’”’ are higher than 
other means, and significantly so. 


Levels of Answering Questions. One might ask whether the low scores obtained by 
our malingering groups were not the result of frequent refusals to answer, or whether 
the two groups actually attempted the same number of answers and simply answere 
differently. We therefore counted, in groups A and B, the number of question 
which an answer was attempted, and the number to which no answer or “I do 
know’ was given. We also noted the number of answers which received full credit 
and the number receiving zero credit. 

The number closest to the median of the combined distribution of groups A a1 
B was used as a “cutting point’ on each count. Frequencies below and above th 
points were used in a two-fold table and the chi square test, suggested by Cronbac!] 
for non-normally distributed data, was used to test the null hypothesis. The medi 
values of each group are shown in Table 4, together with the probabilities that 
X* values obtained in the comparisons are significant. The differences betw: 
groups A and B are not significant for the number of items attempted, nor for t 
number not attempted. Therefore, the two groups gave answers with equal | 
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Taste 4. SIGNIFICANCE OF DIFFERENCE BeTweEen Groups A anv B On Four Ditrrerent Counts 
oF 25 Kent Irems. Basep on X2, pF = 1 
Groups 
A B 
Count Median Median P 
(1) Items attempted ; 24 95 


(2) Items answered, “I don’t 
know” 95 


(3) Items receiving zero credit : ¢ <.001 
< 


(4) Items receiving full credit .001 


quency, and the difference in their mean scores must be attributed to a difference in 
the number of correct answers, or to the difference in the value of the answers given, 
or both. We can reject the null hypothesis on the basis of the P values found for 
counts 3 and 4 (see Table 4). The two groups differ significantly in the number of 
items on which they obtained zero credit and full credit. Group A had more items 
given zero, while group B had more given full credit. 

Item Analysis. Different amounts of credit ranging from zero to four are given on 
the Kent test. A check was made to discover on which items the feebleminded and 
malingerers earned different amounts. The chi square technique was applied to 
determine the significance of the difference in the distribution of credits earned on 
each of the 25 items. On all but five questions (11, 12, 16, 17 and 21), the differences 
in the distribution of credits were considered to be significant.’ Except on Q. 23 and 
Q. 25, which are relatively difficult, the feebleminded Ss usually earned the high 
credits, and the malingerers received the low. 

P values between 2% and:5% are taken to indicate only a tendency on the part 
of the groups to differ in the distribution of credits on each item. Q. 12 and Q. 17 
had P values at this level. 

A more critical discrimination between groups seems possible from this item 
analysis than from mean total scores. One way to recognize the individual malingerer 
is to note the questions on which he has earned very low credits. The more often 
he scores low on those 18 items on which most of our feebleminded Ss scored high, 
the more he may be suspected of malingering. Also if he does well on Q. 23 and Q. 25, 
but has a very low total score, he can be suspected of malingering. 

Examination of credit distribution by group D malingerers revealed a scoring 
pattern very similar to that of group A, but group D is so small that the distribution 
of credits earned by it was not put to a statistical test. On six of the 20 differentiat- 
ing items not a single feebleminded person failed, i.e., received zero credit.* Ss in 
groups A and B were sorted into two categories, those having zero and those having 
more than zero on each of these items. The differences were all significant beyond 
the .001 level’. Group B had only two failures on Q. 7 and three on Q. 9, so on 
these eight questions there were only five failures out of the 464 possible answers. 
On these eight questions, there were 188 failures among group A’s answers. These 
eight items may be most useful when the Kent is employed to identify groups or 
individuals suspected of malingering, because these are items which feebleminded 
people seem unlikely to fail. Other studies®: *. ©) also report this tendency for mal- 
ingerers to fail easy items and to pass hard ones, while the control Ss do the opposite. 
Also, the items are useful because their simplicity makes failures easily recognizable 
as “forced” errors. 


5P values of 2% or less. The .601 level of confidence was found for 14 of the 20 items. 
‘Questions 1, 6, 8, 10, 14, 15. 
"Based on X?, d.f. = 1. 
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Qualitative Differences. Perhaps the most striking difference between the mali: 
groups and the truly feebleminded is seen in the kinds of answers given. Di t 
qualitative differences were apparent in both correct and incorrect answers. Thys 
we have an additional cue for the detection of malingering. We have tried to 
marize certain response characteristics of the malingerers and we give here a 
examples: 

1. The malingerers’ answers were often foolish to the point of being nons: 
cal. The question, ‘If your shadow points to the northeast, where is the sun?” 
answered with such replies as, “In the sky” and “Under the oak tree.”’ The fee! 
minded Ss all named a direction although none got it right. To the item: “Tel! 
the names of some birds,”’ group A Ss often named things which are not birds at 
e.g., ““mosquitoes,”’ ‘‘airplanes,”’ “‘rabbits,’’ and ‘‘cocker spaniel.’’ The Ss in gr 
B named only commonly known birds. 


2. The malingerers were inclined to give “wild” or exaggerated answers. On 
the question, ‘‘How many days in a week?” 26 failed, and gave wrong answers rang 
ing from 2 to 82. Only three in group B failed, and they answered with ‘Five 
ae 


> 


3. The malingerers showed a general knowledge, sophistication and humor 
which the feebleminded person lacks. 

Q. 4 ‘What is sand used for?” Typical of group A answers were “hourglasses” 
“spinach.”” Feebleminded Ss rarely failed this, and those who did gave 
answers like ‘‘For the soil,’”’ and ‘‘To fill holes.” 

Q. 10 “What does ice become when it melts?” Typical malingering responses 
included such implausible answers as ‘‘Rocks” and the somewhat mor 
understandabie “Ginger ale.’ None of the feebleminded failed this 
question. 

“Give me the names of some large cities.’”’ Malingerers named foreign 
cities and countries, Cape Cod, the United States and the Bronx. Thé 
feebleminded erred by naming states and small towns. 


1. The malingerers displayed a vocabulary superior to that of feebleminded 
persons. On the question, ‘‘What metal is attracted by a magnet? practically every 
feebleminded 8 asked the meaning of the word ‘‘magnet’’ while only a few of the 
malingerers did so. Throughout the test, group A members used such words as 
“metropolis,” “‘magnetic,” ‘“electrocute,” ‘‘artificial,’’ “asbestos,’’ ‘‘porous,”’ 
“contradictory.” 


5. The malingerers answered more quickly than the feebleminded on thi 
hard questions and more slowly on the easy ones. While we have no reaction tim¢ 
figures to prove this statement, the subjective experience while testing was clear 
enough to make this a distinct possibility. 


TY SCUSSION 


In view of our results, one may ask whether our malingerers knew the differ 
ence between feeblemindedness and ‘“‘insanity.’’ It would appear that they we 
trying to be “crazy” rather than ‘“‘dumb.” Some explanation at the start might 
have helped their understanding of the difference but that would have been tanta- 
mount to telling them how to act. Our results indicate that not only does the averag 
person not know how to feign‘mental deficiency, but neither does a college stud 
enrolled in psychology. 

Hunt and Older“? reported that the Navy men did not play ‘‘dumb” enoug 
but our women Ss played too “dumb.” Perhaps women performing in an experim« 
for a teacher are not comparable with Navy men participating in an experim: 
under the command of an officer. 

People who are merely feigning malingering may be more self assured and m: 


; 


easy going than those for whom the test results are crucial. Any experiment puts t 
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malingerer in an artificial situation, but the main task in this situation is to feign 
mental deficiency and that is what the true malingerer wants to do. Pollaczek “ 
discusses this point and concludes that “there is little reason to suspect that any 
other personality or intellectual factors are operating in any important fashion to 
distinguish the behavior of the two types of malingerers.’’ It seems to us that per- 
sonality factors provide the best explanation for the test behavior of our malingerers. 
Possible explanations include: (1) the women had the chance to act stupid with- 
out fear of criticism or punishment. (2) They enjoyed playing the role of the “happy 
little moron,’’ who is childlike, amusing and lovable. (3) They were able to pull jokes 
on the examiner even though they were following instructions. (4) They could give 
wrong answers and'still show how clever they actually were. A deeper analysis of 
each of these ‘explanations’ could of course be made to examine the real personality 
factors involved. 

Probably it is easier to think up funny answers when writing than when talking, 
and distance from the examiner may provide the courage needed to give such 
answers. The fact that group C obtained significantly lower scores than group A 
supports this idea. Further support is found in the fact that the no-response total 
was smaller for group C than for group A. 


SUMMARY 
This study located quantitative and qualitative differences between the test 
performance of truly feebleminded women and women who were asked to feign 
mental deficiency. Differences between groups were noted for total raw scores and 
for individual items. The full Kent EGY was used, but the comparisons made here 
should be applicable to many other tests, and are suggested as clues for the de- 
tection of malingering. 
The results of this experiment include: 
1. The mean scores of the malingering groups were significantly lower than 
that of the truly feebleminded group. 
2. It is impossible to identify malingerers from individual total scores unless 
they are extremely low. 
3. - Malingerers were inclined to fail the easy questions and pass some of the 
hard ones. 
4. Malingering and feebleminded groups were found to have significant 
differences in the distribution of scores on 20 out of 25 questions. 
5. Qualitative differences in answers were apparent throughout the test. 


The quantitative and qualitative differences observed here may be useful in 
the detection of malingering by use of the Kent or some other available tests. These 
differences, and possible further exploration, seem to provide a foundation for the 
construction of a test specifically designed for the detection of malingering. 
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RELAXATION AS AN ADJUNCT TO PSYCHOTHERAPY 
MAJA SCHADE, THELMA HRUZA, ANNETTE WASHBURNE, AND MARIE CARNS 


University of Wisconsin 


INTRODUCTION 


This study was undertaken in order to discover (1) if the use of a specific technique 
of relaxation as an adjunct to psychotherapy is effective in the relief of certain soma- 
tic and psychological manifestations and (2) if so, which types of psychiatric dis- 
orders respond best. 

There have been a number of reports on this subject. Hadley,“ Paseal 
Schultz, and Yates“ reported relaxation successful in alleviating signs and symp- 
toms of tension; in promoting insight in psychosomatic relationships, and thus 
assisting in facing and meeting problems. Though their methods varied, these in- 
vestigators observed that the reduction of neuromuscular hypertension supported 
the psychotherapeutic approach to the patient as well as giving him relief. In 
these instances the method was taught by the therapist himself and directly applied 
in the therapeutic session. Bagchi,’ Behrend and Weiss,“ Bouman et al, 
Jacobson,“ and Hosskisson,“? have reported the alleviation of somatic and psy- 
chological symptoms through:relaxation alone. 

When psychosomatic disturbances are caused by deep seated personality dis- 
orders, however, it seems doubtful that lasting effects may be achieved by relaxation 
alone. On the other hand, the authors felt that a specific training in a type of relaxa 
tion which gave the individual a kinesthetic awareness and an intellectual means of 
controlling tension would be a desirable adjunct to the more traditional forms of 
psychotherapy. The difficulties of the patient would thus be approached through 
both somatic and psychological channels. The technique“® used in this study ap- 
peared to have the rationale desired. 


METHOD 


This investigation was carried out as a joint project by members of two depart- 

ments of a University. The instruction in relaxation was given in the department of 
physical education for women. ‘The subjects were referred by members of the staff 
of the neuropsychiatric division of the student health department. No direct us 
of relaxation was made during the psychiatric conferences. 
Subjects. The subjects of this study were eighty-two university students chosen 
from approximately three thousand students who sought aid in the neuropsychiatric 
service of the student health department during a three year period. Forty-five 
were females and thirty-seven were males. The age range was eighteen to forty 
years. Each student was given a detailed psychiatric study by a psychiatrist or a 
clinical psychologist and was considered a suitable case for this investigation be- 
cause of the degree of neuromuscular hypertension present. Every reasonable effort 
was made either to establish or to rule out any organic pathology which might 
directly account for the signs and symptoms. In two cases, one of active duodenal 
ulcer and one of spastic cerebral diplegia, the organic pathology undoubtedly con- 
tributed directly to the somatic manifestations. These two cases are presented in 
some detail in the section entitled ‘Results’. 

Table 1 presents a summary of data considered significant from the medica! 
point of view and for the most part is self-explanatory. It should be pointed out, 
however, that the somatic and psychological symptoms listed are those volunteered 
by the students during medical and psychiatric conferences. Symptoms so listed 
are often more significant, at least in the mind of the patient, than answers to check 
lists or responses to direct questions which might be suggestive. 


Procedure. The method used in this study was designed to provide the subject with 
a technique for inducing within himself a condition of psychosomatic quiescen 
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Through concentration on the release of muscular tension voluntary control over 
tension may be achieved, not only for general reduction of tension during rest, but 
for application in situations where excess tensions interfere with adequate adjust- 
ment. 


TaBLe 1. Summary or Mepicat Data 


Females Males 


Total 


Somatic MANIFESTATIONS 
Excessive Fatigue 3i 2 64 
Insomnia 3: 28 60 
Headache > > 43 
Aches or Pains (Vague) : 40 
Tremor 26 
Dizziness 
Urinary Frequency 
Nausea 
Constipation 
Recurrent Diarrhea 
Precordial Pain 
Emesis 
Sweating 


Corr DEN O 


KoonNno > 


PsycHOLOGIC MANIFESTATIONS 
Confusional Thinking 
Worry 
Retardation 
Weeping 
Depression 
Blocking 
Intellectual Inadequacy 
Social Inadequacy 
Irritability 
Dissociation 
Phobias 


DIAGNOsIs 

Anxiety Tension State 

Anxiety Neurosis 

Psychopathic Personality 

Benign Depression 

Cyclothymia 

Conversion Hysteria 

Hysteria 

Obsessive Compulsive Neurosis 

Paranoid Personality 
(Post-Psychotic) 

Spastic Cerebral Diplegia 


Since concentration implies effort, which is contrary to relaxation, the actual 
muscular release is preceded by the establishment of an attitude of quiet and passiv- 
ity. This attitude is obtained by breathing exercises. The subject is told to observe 
his respiration without interfering volitionally with its rhythm, but rather to “let 
it happen” naturally. Next, the physiologically relatively passive expiratory phase 
is extended by “‘giving in” to it intentionally, thus directing it without effort. The 
result of these practices is a quiet relaxed breathing rhvthm and a mental attitude of 
passivity. 

The release of muscular contraction is achieved by thinking of or imagining the 
body part concerned in its relaxed state, i.e., an arm feels heavy, the shoulder girdle 
wide, etc. Since exhalation appears to facilitate relaxation, “: ». 7 4, P. 26 
ciated with the release of tension by the instruction to “let the arm (for example) 
become heavy” with the exhalation. This strong association of the expiration with 
the muscular release enables the individual to use the relaxed exhalation to initiate 
a reduction of tension on those occasions where excess tensions interfere with ade- 
quate functioning. 


it is asso- 
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The training proceeds from the arms and shoulder girdle to the legs and hips 
and finally to the throat and face. Each step is taken when the preceding on» has 
been accomplished and when some sensation of release has been noticed in the sue- 
ceeding one. The spreading of relaxation from one part of the body to another has 
been observed by Schultz®. »- * and it seems to happen in a fairly regular way 
Waiting for the spontaneous spread seems to reduce concentrative effort which may 
interfere with progress. Moreover, progressively larger portions of the body are re- 
laxed simultaneously, which further reduces concentration on details and over-al] 
general relaxation is reached more quickly. Each practice trial is terminated by a 
deep inhalation and a stretching of the entire body. This aids in the re-establishment 
of the alert state and prevents the persistence of drowsiness. 

A minimum of two, preferably three, daily practice periods is suggested, which 
should at first not exceed ten minutes. After two to three weeks of instruction the 
time limit may be extended. It appears important to hold to short practice periods, 
because tension results from prolonged effort, or concentration fails, and either inter- 
feres with progress. 

The training begins in the supine position and is continued in the sitting posi- 
tion. At this stage the practice of partial relaxation outside of the regular home 
practice is urged. The instructions are to exhale in the relaxed manner and relax 
the shoulder girdle. Sufficient practice enables the individual to utilize relaxation 
during tension producing situations and thus assume a relaxed attitude during 
activity. 

In this study the instruction periods of approximately 40 minutes were held 
twice weekly during the first week. In a few cases of very slow progress these were 
continued for one or two weeks longer. After that the instruction proceeded with 
one session a week. The average time found necessary to acquire the technique was 
eight to ten weeks. However, with subsequent practice, the subject may be expected 
to gain in depth of relaxation as well as in speed with which the relaxed state is 
established. The total time under instruction of the subjects of this study varied 
considerably, because of such factors as interference from the University schedule, 
the large range of disorders and conditions studied, and the individual’s dependenc: 
on the instructor’s help. 


RESULTS 

The data on the results of this study consisted (1) of the critical evaluation of 
specialists, based on detailed case histories and examinations, (2) of records on t! 
course and progress of training in relaxation, and (3) of reports of the subjects or 
benefits obtained from relaxation. No controls were attempted because it was felt 
that individuals with such diverse personalities and psychiatric disorders could not 
be accurately cross matched. 

The general status of the subject was evaluated by the therapist at the last 
psychiatric conference. Sixty-eight (82.9\;) of the group were judged improved; 
ten (12.2%) were unimproved. In four subjects (4.9%) insufficient data prevented 
evaluation. What portion of these improvements is the result of the relaxation train- 
ing and what portion the result of the concomitant psychotherapy cannot be ob- 
jectively determined. In a few cases, where psychotherapy was discontinued during 
the training period, relaxation has clearly played a considerable role. In other in- 
stances improvement of the general status of the subject took place while relaxation 
proficiency remained poor. These cases are reviewed in some detail in the reports 
on diagnostic groups. 

Judgment of relaxation proficiency was passed on a purely subjective basis 
from such physical signs as general flaccidity of the skeletal musculature, quiet re- 
laxed breathing and absence of movement of the eyelids and eyes; also on the 
students’ report on their ability to use the technique in the control of tension. Thos 
who were rated “excellent” (15.9%) were able to relax quickly and deeply. They 
were consistent in obtaining good results from practice as well as from use of thi 
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technique in tension producing situations. Subjects who were rated ‘“‘good”’ (41 

did not relax as rapidly, or were less consistent in reaching their maximum de; 
They were successful in the application of the technique outside of practice peri 
Subjects were rated “fair’’ (28.0%) when relaxation seemed to remain “shallo 
and when success in the use of the technique was only partial or inconsistent. The 
rating “poor” (14.6%) was given when practice resulted in partial or occasional 
success only. 

Table 2 presents a summary of results with the eighty-two subjects divided into 
diagnostic groups. The last columns give an account of alleviations of tension mani 
festations considered by the students as contributing significantly to the improve- 
ment of their general status and to their adjustment to University life. The figures 
are conservative in that they were volunteered to the instructor of relaxation. 


Somatic Signs and Symptoms. Among the somatic symptoms and signs, insomnia 
was the one showing the most rapid improvement. Not infrequently, improvement 
was reported in the first two weeks of training. Only one out of sixty subjects with 
this complaint did not experience improvement. Headaches and abdominal pain 
and discomfort responded well. In some cases they gradually disappeared as the 
subjects became more relaxed generally. In other instances, relaxation was delib- 
erately used to combat these symptoms. 

One of the most common complaints was painful tension in the muscles of the 
neck and shoulders. When these were severe and of long standing it proved helpful 
to loosen the muscles mechanically by passive movement and stretching of the neck 
by the instructor. They disappeared or improved greatly during the training. 

The amount of recuperation from short practice periods of relaxation is sur- 
prising to most subjects. They reported they felt more refreshed than after sleeping, 
particularly sleeping in the afternoon. 


Psychological Signs and Symptoms. Students’ reports on improved mental efficiency 
refer to the aid relaxation has given to study. Concentration improved and the span 
of attention was considerably extended by short practices of the technique while 
studying. In some instances the effect of the practices was, of course, a reduction of 
fatigue. But more often the subjects felt that inability to concentrate and a general 
restlessness prevented successful continuation of work, and these were overcome by 
relaxation. fi 

The subjects who reported they felt generally more relaxed were those who had 
been conscious of and disturbed by a sense of hurry, restlessness and irritability 
In some the improvement came quite automatically as their proficiency in relaxation 
grew. Others deliberately slowed down their pace during activity. They also became 
sensitive to tension and effort exerted in all kinds of activity and deliberately at- 
tempted to preserve their energy. In that way and with the help of relaxed breathing, 
those who had to work with great speed, as when waiting on tables or working in 
kitchens of large restaurants, learned to remain relatively relaxed in spite of the 
pressure under which they worked. 

Perhaps the most significant achievement for a great number of subjects was 
the control of emotionally produced tensions in situations in which adjustment and 
adequate functioning were extremely difficult or impossible. Some practiced relaxa 
tion before the anticipated critical occasion and the after effect carried them through 
without further difficulties. Others used the partial relaxation shortly before or 
during the time of stress. Still others observed in themselves at such times localized 
muscular tensions, as in the shoulders, abdominal muscles or the hands and feet ars 
found that the ability to release them combined with the relaxed exhalation, brought 
about a general lessening of tension. After-reactions, either somatic or psychological, 
ceased to occur or their duration was greatly reduced. Many of the reports of im- 
provement in insomnia, tremor and abortion of headaches and abdominal pain 
belong in this category. 
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Di1aGNostic GROUPS 


The discrepancy in numbers of the different diagnostic groups makes statistical 
evaluation impossible. It appears of interest then to determine the possible sig- 
nificance of such training in relation to the individual’s difficulties and also if any 
particular type of case responded more or less favorably. 


Anzxiety-Tension State. As a group the anxiety: tension states responded well to the 
training in relaxation. Thirty-eight of them were estimated as improved, one as un- 
improved. In one case, insufficient data made estimation of the status impossible. 
The large majority of these subjects possessed a drive for self-efficiency. Willing to 
work for their own progress, they recognized and appreciated relaxation as an active 
control method with which they could rationally and concretely combat their 
difficulties. Good results seem predominantly dependent upon the individual’s 
ability and willingness to persevere in practicing. Twenty nine of the forty subjects 
developed good or excellent proficiency in relaxation. Ten remained fair. Of these, 
four did not finish the training. Four were dependent individuals unwilling to take 
responsibility for themselves. They relaxed quite well under supervision and when 
circumstances were favorable, but never really tried to develop skill. In these the 
results of practice were mainly recuperation and improvement of insomnia. 

Only fair relaxation was achieved by the two subjects who progressed to more 
serious disorders. One showed temporary improvement. He practiced relaxation 
regularly, reducing fatigue and feeling generally more relaxed. But under great 
strain, which developed from certain situational stimuli, his attempts to use the 
technique were only partially successful; though on less critical occasions, as exam- 
inations, it helped him considerably. A few months later he developed a schizophren- 
ic episode and left the University. In this case relaxation could be considered a 
supportive measure. The second developed a schizophrenic episode seven months 
after termination of the relaxation training. Upon reentering the University she re- 
turned voluntarily for further instruction. After two months her relaxation pro- 
ficiency became good. An abominal discomfort could be controlled by relaxation 
practice and she used the technique successfully for the reduction of fatigue and 
general tension. This subject, though unable to relax during the acute phase of her 
illness, received valuable help through it during a remission. 

One case of anxiety tension, judged improved by the therapist, remained 
definitely poor in her ability to relax. She did not practice regularly and made no 
attempt to do so in the face of difficulty. Considering her poor achievement, relaxa- 
tion can hardly have contributed to her general improvement. 

The subject with active duodenal ulcer, referred to previously, was classified 
in this group. He expressed great apprehension over his condition and showed signs 
of marked anxiety. Preparing for his Ph.D. oral examination, he insisted he could 
not follow the recommendation to enter the student infirmary for bed rest. After 
four weeks of intensive psychotherapy he was referred for training in relaxation. He 
grasped the essentials of the technique quickly and made rapid progress. After the 
second month he entered the infirmary for bed rest, relaxation instruction being 
continued there. He improved rapidly and was able to complete his academic work 
on schedule. In his last conference at the end of the academic year he stated that he 
continued to use the relaxation technique and felt that he had been able to ac- 
complish more work with less expenditure of energy and much less tension. Four 
months later, after holding a responsible research position for three months he re- 
ported no return of symptoms. 

In an attempt to evaluate relaxation as an adjunct to psychotherapy four 
subjects should be considered separately. These students did not receive psycho- 
therapy. In three, initial psychotherapy had failed to win a basic rapport. They re- 
mained negativistic and unwilling to face their problems. Through relaxation they 
obtained relief from their tension manifestations to the extent that they were con- 
sidered improved. The fourth case receiving no psychotherapy cancelled her psy- 
chiatric conferences shortly after she began the relaxation training. She developed 
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considerable control over her symptoms and as a result gained in self-confidence 

felt much improved. She was not judged improved, however, by the therapist, si 
her basic anxieties were not considered relieved. 

Anxiety Neurosis. In the group of eleven cases of anxiety neurosis only four follow 
through with both psychotherapy and relaxation. These ‘our possessed a str 
drive for self-efficiency. They developed good skill in relaxation and made inte! 
gent use of it. All improved greatly. Two years after the training they remained 
improved and continued to use the relaxation technique. 

One subject did not continue psychotherapy after alleviation of symptoms 
through relaxation and therefore no data could be obtained by the therapist. Sh 
learned to relax quickly and deeply, practicing regularly and frequently. After two 
and a half months she believed herself on a much better functioning level. Two 
other subjects were estimated by the therapist as improved. Neither completed the 
training. Four subjects failed to improve. They did not foilow through with psycho- 
therapy or relaxation. 

Psychopathic Personality. With one exception the work with relaxation in this 
group proved either unsatisfactory or very tedious and prolonged. The subject with 
excellent relaxation proficiency had instruction for three and a half months and con- 
tinued with occasional follow-up for four months more. A later written communica- 
tion from her told of success and progress in work and personal adjustment. Nothing 
is known of the others. For these relaxation proved to be a supportive measure dur- 
ing the time they were under instruction. They used it for the reduction of fatigue 
and to increase their ability to study longer and more effectively, thus keeping them 
on a better functioning level. 

Benign Depressicn. This diagnosis was applied to those patients who displayed 
evidence of a depressive reaction of a non-cyclic type. Etiologic factors were found 
in situational stimutii. 

Of the three students with benign depression two improved. In one of these 

relaxation could not have contributed, since he remained poor in his ability to relax 
due to lack of practice and interest. In the other very good results were obtained 
She was considered an emergency case because of retardation, dissociation, feelings 
of futility and suicidal ideas. After two months she relaxed well and stated she had 
“built the practice of relaxation into her life’. A follow-up a month later showed 
that she had continued to make use of the technique and stated, ‘‘I simply have no 
reason any more to go into a panic’. Written communications during a three year 
period state that she has had no recurrence of symptoms. The one whom the ther- 
apist did not consider to be improved was a Siamese woman whose main problem 
seemed to be adjustment to life in this country. She learned to relax well and re- 
ported good results ‘from relaxation in recuperation and greater ability to con- 
centrate. Sbe did not return to the Clinic during the remainder of her attendance 
at the University. 
Cyclothymia. This diagnosis was applied to one individual whose mood shifts wer 
cyclic, often seasonal and reactive in nature. In no instance did the degree of “‘ups’’ 
and “‘downs”’ exceed a pattern of mild hypomania or retardation. This young woman 
was a rigidly tense individual. She began the training while a depression was sub- 
siding and though progress was initially slow, she did well ultimately. She had been 
keenly aware of her tenseness and her difficulty in controlling it. The realization 
that she could learn to relax greatly increased her self-confidence and gave her - on- 
siderahle encouragement. 


Conversion Hysteria. This diagnosis was made in one student who demonstrated 
transitory episodes of anxiety or panic reactions. After eight months of psycho- 
therapy this subject had gained good insight into the basic etiology of his difficulties 
He was free from all symptoms except for occasional periods of cramping and weak- 
ness of the legs. At this point he was referred for training in relaxation. After two 
months of instruction he relaxed deeply and quickly. When last seen he said ‘‘T! 

leg symptoms are essentially gone; when any feeling of weakness or spasm occurs, 
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it can be relieved by relaxation’. He has had no recurrence of his original diffi- 
culties over a three year period. 

Hysteria. Only two of these subjects really applied themselves to the learning of 
the technique. Both improved and made good use of it. The three others never 
seemed to understand the procedure or how to practice it. Of these, two remained 
unimproved and the third, though considered improved, relaxed poorly. 


Obsessive-Compulsive Neurosis. The improvement for the group was considerable. 
However, in four students the relaxation training could not have contributed to it, 
sitice two discontinued the instruction and all four relaxed poorly and failed to 
practice regularly. 

Two subjects diagnosed as obsessive-compulsive neurosis with schizoid person- 
ality had considerable difficulty relaxing during periods of anxiety. They always 
practiced during remissions. As a result they felt more relaxed throughout the day 
and less fatigued. In one subject the periods of remission lengthened considerably 
during the two years following instruction in relaxation. Whether or not this is a 
result of his ability to relax must remain a matter of speculation. 

Of special interest is a case of agorophobia in a female student who was unable 
to leave her home unless accompanied by her husband. She had marked suicidal 
tendencies which reached panic levels. She was also disturbed by daily episodes of 
feelings of unreality. She learned to relax excellently and worked with considerable 
tenacity on the control of her difficulties. She claimed she could control the panic 
when, at the first sign of it she could lie down and relax. By controlling the panic 
she felt the suicidal fears were eliminated. In the same way she has eliminated the 
feelings of unreality and has had no recurrence for the last two years. In general she 
felt much better, although her phobia was essentially unchanged, except for a slight 
extension of her radius of movement to a few blocks around her home. 


Paranoid Personality (Post Psychotic). One patient could not be placed under any 
established diagnosis. When first seen by us he was still convalescing from a severe 
schizophrenic psychosis. He had had intensive treatment elsewhere with insulin and 
electric shock therapy. With this student regular relaxation practice under super- 
vision was continued for a considerably longer period than with any other subject. 
Ultimately he relaxed well. For him the main benefit lay in the reduction of fatigue 
and the elimination of insomnia. The practice also reduced his tendency toward 
agitation and irritability. 

Spastic Cerebral Diplegia. This case, referred to previously, had been observed for 
approximately two years in the Neuropsychiatric Clinic at infrequent intervals 
(nine conferences). Neurological examination revealed gross horizontal nystagmus 
with right homonymus hemianopsia, a right peripheral facial palsy, constant gross 
tremor of both hands with choreiform movements, a moderately ataxic gait and a 
speech defect. He was granted a graduate assistantship, which position entailed 
teaching with the necessity for blackboard writing. Under this stimulus the tremor 
and choreiform movements in his arms increased. It was felt that instruction in re- 
laxation might help him. During the first two weeks involuntary movements in the 
left arm and foot were pronounced and quite distracting. With an increase in ability 
to relax involuntary movements disappeared. At the conclusion of the training, 
which lasted two and a half months, he gained considerable skill which he not only 
practiced regularly but which he could also apply at times of strain. The results 
were expressed by the student as foliows: “I feel my arms are under better control; 
in social situations I believe the tremor is hardly noticeable; with conscious effort at 
relaxation I can always control my arms’’. As a consequence of the improvement he 
had gained greatly in self-confidence and poise. 


SUMMARY AND CONCLUSIONS 


In this study an attempt was made: (1) to evaluate a specific technique of re 
laxation as an adjunct to psychotherapy, and (2) to discover whether any particular 
types of cases responded more or less favorably to a training in relaxation. 
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The technique under investigation demonstrated its value to psychoth 
by the following contributions: 

1. Relaxation breaks into signs and symptoms of tension, giving relief 

raising self-confidence. 

2. It demonstrates psychosomatic relationships to the subject. A recognitior 
of these relationships facilitates intellectual insight and provides the possibility 

for rational management of tension manifestations. When medication such as 
sedatives is given, the individual is merely the recipient, while in relaxation 
he is provided with an active control method. 

3. Relaxation may break up habit patterns of hurry and excessive activity 
thereby improving functional balance. 

4. With the necessity for perseverance and self-discipline, the training in re- 
laxation becomes a process of self-education. 

As to the question which type of cases responded best to the relaxation train- 
ing, certain general conclusions can be drawn. It was observed that some individ- 
uals with more serious disorders responded as well, if not better, than some with 
benign disorders. Hence, it is believed that response is dependent primarily upon 
personal qualities. Our observations lead to the following conclusions: 

1. When personal qualities, such as perseverance, aggressive drives, and the 
will for self-efficiency are present, good and excellent results from the relaxa- 
tion training can be expected even under difficult conditions. 

2. The anxiety-tension cases, i.e., disorders of relatively recent occurrence, or 

of reactive rather than affective type appeared to respond most readily. In 
this relatively benign disorder the large majority had a strong desire for self- 
efficiency, and they were willing and able to work for their own progress. 

3. Individuals who lack self-direction, either by reason of their illness or who 
are overly dependent do not respond favorably. For these, however, the train- 
ing may constitute a supportive measure of considerable help. 

t. The reduction of tension manifestations and the ability to maintain an 
adequate functioning level through relaxation may occasionally interfere with 
the individual’s continuing psychotherapy. In reactive disorders the training 
alone may conceivably suffice because of its self-education, its breaking of 
habit patterns and the fact that it is an active control method. This cannot be 
expected for the more severe and deep seated personality disorders. 


Psychotherapy depends to a large extent upon self-direction and the will for 
self-efficiency. These are stressed and encouraged through the training in relaxa- 
tion. Also, psychotherapy is more successful in a relatively relaxed than in an ex- 
cessively tense individual. Hence relaxation may be considered a valuable adjunct 
to psychotherapy. 
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ON THE MATHEMATICAL REPRESENTATION OF THE SYNDROME 
FRANK M. DU MAS 


Louisiana State University 


INTRODUCTION 


The syndrome is a complex, multivariate phenomenon. It may be thought of 
as a more or less unique configuration of symptoms. The configuration of symptoms 
of one syndrome may be different from that of another syndrome in two basic 
aspects: ‘qualitative and quantitative. For example: Syndrome, Si, may have the 
qualitative symptom categories, C, C3, Cs, Cio, Cis; syndrome, Se, may have the 
qualitative symptom categories, Ci, Cy, Cs, Co, Ciz, Cox. Si may be quantitatively 
different from 82 also. For example: 8; and 8: both have the qualitative symptom, 
(Cy, but 8: may have a lower degree of the symptomatic quality than S:. That is, S;, 
may be represented by a lower point on a continuum of magnitudes than S:. Syn- 
dromes, then, may differ in the categories that are exhibited and the magnitude of 
categorical characteristics. 

Patients exhibiting the same syndrome may be differentiated by the gross 
verbal numbers: mild, moderate, severe. For example, two individuals exhibit the 
same syndrome, §;, and therefore the same categories C;, C3, Co, Crp and Cy; but one 
individual has a low degree of these attributes and is diagnosed ‘mild’; the other in- 
dividual has a high degree of these attributes and is diagnosed ‘severe’. The pre- 
ceeding general statements apply to syndromes of any sort. The discussion that 
follows will be restricted to the psychiatric syndrome, but much that will be said 
will apply to other types of syndromes. 

The typical medical or psychiatric description of a syusiestine is usually verbal. 
This verbal description may be long or short, a paragraph or several pages. The 
typical description of a syndrome by a clinical psychologist is also mainly verbal 
but quantitative data in the form of test scores and pictographic representation in 
the form of profiles or psychographs are often included. We shall attempt to de- 
velop a rationale for the extension of quantitative descriptions of syndromes. Our 
attempt should not be regarded as a lack of appreciation for verbal descriptions but 
rather as an attempt to supplement these descriptions by quantitative or mathemati- 
cal ones. 


A More ConvENTIONAL APPROACH TO THE PROBLEM 


For a complete and exact description of a human being in all respects a very 
great number of variables would have to be considered: organic, environmental, 
experiential and behavioral. 


Definitions. 


me : the total number of variables required to describe human beings 
in all respects. 


Ne = mM —Ni : where n; is the number of irrelevant variables and n,; is the num- 
ber of relevant variables required to describe human personality. 


< % : where n is the number of orthogonal variables required for an 
adequate description of human personality. 


By ‘relevant’ we mean that a variable makes a contribution to the prediction 
of a phenomenon. We do not specify operationally how this is done because there 
are many techniques which can be applied to the same data for purposes of pre- 
diction. By ‘irrelevant’ we mean that a variable does not make a contribution to 
the prediction of a phenomenon. It is obvious that n, will be fairly large. Its size 
would seem to obviate any practical application of the concepts presented re. 
Perfection is never attainable but approximations are. The last term defined, n, is 
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necessary if we are to attain an approximation to the best possible represent 
and prediction. This approximation should be adequate to our needs as cli 
psychologists but it is obvious that it will never be perfect. 


The n variables may be regarded as the dimensions of an n-dimensional sp} 
Equation (1) is the general equation for an n-dimensional sphere. 
72 72 r2 2 
Xi+X%,+...+X%, =F, (1) 
where X;, Xo,....Xn, are values of the orthogonal relevant dimensions, r is the rad- 
jus, and the origin is zero. 


Let us suppose that the scores of the general population are normally dis- 
tributed, written in the form of standard or Z scores, and the limits of each of the n 
variables are arbitrarily set at +10 Z. We can then conceive of an n-dimensional 
sphere whose origin is zero (the intersection of the mean Z scores) and whose radius 
is 10 Z. All points for the population should be locatable in this hypersphere. 

The means of the syndromic group will probably be other than the population 
means (Z = ©) on each of the n dimensions. The means of the syndromic group 
define a point (Mx,, Mx,, siya Mx,) in n space; the n scores for an individual define 
a point in n space also. 

Any syndromic group will have a distribution and a standard error of its mean, 
oMx, on each of the n dimensions or n subtests. Some sort of an average standard 
error of the means can be calculated. Since the standard errors of the means on the 
subtests will probably not be equal, the derivation of an average standard error of 
the means will yield a statistic which is at best only approximate. However, some 
notion of the reliability of the syndromic point may be obtained by simply summing 
the variances of the means for all subtests and dividing by the number of subtests 
n, as 


2 2 2 = ‘ 
I = 1/n (omx, + TMxy + Mx,, (2) 


Where o, is the variance for a point and the square root of this value is an average 
standard error of the n means. 


Let the syndromic means define an origin for other hyperspheres and let 105, 
re 2 . ee » © 

2ep, 3ep, .. . . be successive radii of these other hyperspheres. We have then a set 
of encapsulated or concentric hyperspheres and some idea of the reliability of the 
syndromic point is obtained. 

An equation for the syndromic hyperspheres may be written 

(X, - Mx,)” + (X2- Mx,)’ Se (X,, _ Mx,)’ = o*. (3) 

Where, 

Xi, Xe, ... X» are values of the orthogonal relevant dimensions, 

Mx,, Mxg, .. - Mx, are the means of a syndromic population, 

Origin: (Mx,, Mx,, rye Mx,), 

op: the mean standard error of the n means. 

It is obvious that what has been said can be done for each of several syndromes 
It also applies to the profile for an individual provided some estimate can be ma 
of the variance of that individual. If the individual cannot be retested several times 


then one approximation can be made by using the standard error of measurem: 
for each of the subtests and appropriate substitution in formulas (2) and (3). 
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A New APPROACH TO THE PROBLEM 


In clinical psychology, test profiles have been widely used as diagnostic aids. 
When evaluating a set of magnitudes the psychologist has, in the past, often used 
two methods: subjective evaluation of the test scores and pictographic representa- 
tion. When test scores are graphically delineated we call the graph a profile. Ex- 
amples of this pictographic representation are the MMPI, Wechsler and Kuder 
profiles. 

The profile is composed of a set of subtests or dimensions. An individual's 
scores on the subtests are plotted and a straight line drawn to connect adjacent 
score points. Figure 1 is an example of such a profile. Psychologists like profiles. Is 
it possible to retain the profile but at the same time give it more rational significance? 
The reader should look at Figure 1 again. It will be noted that: 

a. There is no unique sequence or order for the subtests. 

b. The distances between subtest ordinates are arbitrarily equal. 

ce. There is no single continuum for the abscissa. 

d. The profile segments are not rational continua. 
Let us reason together in the hope that we can attain a conception of the profile 
which permits the four statements above to be written affirmatively. 

TaBLe 1. Hyporsetican Data. CompLeTe EXPLANATION IN TEXT. 


Subtests 
A .00 .30 
B .30 .90 
Cc .00 .10 
D 3.30 | 
E 2.30 50 
F 2.00 .65 
G .60 a0 


Z score rsp 


Consider table 1 as data for a syndromic group. Figure 1 is a profile constructed 
from data in the first two columns. Assume that we have a perfect test, p, of the 
syndrome; that is, p has perfect reliability and validity. Assume that we have a 
set of subtests, s denoting a subtest, each of which is correlated with p as shown in 
table 1. Assume further that the distributions of p and s are normal, scores are 
standard or Z scores, each test is composed of a very large number of items, and that 
the correlation between s and 7, 7sp, is linear and positive. Equation (4) is the fam- 
iliar regression equation. 


sd oy 
Y’ = rxy (X-M,) + My. 
ox : 


Rewriting equation (4) in our notation we 


o 
Z, = Tsp — (Z, ‘ Mz,) + Mz.. 
- 


Pp 


Since Mz,, Mz, = 0 ando,,¢, = 1, equation (5) reduces to 


p 


Z. = Ze Yep, (6 


and this is of the linear form 


Y =mX +b. (7) 


It is obvious that Z, is the slope of a line and the ordinate intercept is 0. Suppose 
we have three patients: one is very sick, categorized as ‘severe’ and has a score 
Z, = 4; another is moderately sick, categorized as ‘moderate’ and has a score, 
Z, = 0; the third is not very sick, categorized as ‘mild’ and has a score, Zp = —4. 
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We construct a profile graph by arbitrarily placing the limits of the test di 
butions at, say, +5Z and label the ordinate on the left accordingly. The limit 
the abscissa are 0 and 1. The subtest ordinates are drawn in according to the 
relation of a subtest with the perfect test. This has been done in figure 2. The scores 
on all subtests made by the ‘severe’ patient are predicted from 


r . w 
Ze = 4 Pep. (3) x 
Q 
Che subtests’ scores by the ‘moderate’ patient are predicted from 
Z, = 0. (9) N 
The subtests’ scores by the ‘mild’ patient are predicted from 
Ze = —4 Yep. 
These predicted profiles are the plotted linear functions of figure 2. 
. . + . . a . --* si 
Che equation for the standard error of estimating Y from X is 
yx = Fy \ l = Txy- 
Equation (11) written in our notation becomes 
°Z,2y ” \ 1 - a (12 
since % = 1. The right side of equation (12) is the coefficient of alienation. 
It is obvious that one can calculate °7,z,, for each subtest and then plot a é 
curve through these points. One could plot many curves, say, for + .5 %7,2,) + Loz z 
- 2 77,2,» te. We have plotted + .675 CZ) which corresponds to a probability, P, 
of .50. Within the non-linear bounds of figure 2, an empirical profile should fall 
50° of the time for each of the indicated patients 
The dotted line of figure 2 is the profile plotted from the data in table 1. If the 
linear function denoted ‘severe’ is regarded as a best fit for the dotted profile, then c 
equation (8) is the equation for the dotted profile. 
Another mathematical description of the dotted profile is given by an equation 
for a broken line as follows: 
—— ( 
4 Yep - 
{ 
{ 








Any line can be described by the equation 


Y=a-+bX + cX? + dX'-+.... 
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Ficure 1. ConvenTIONAL ProFILe Con- Ficure 2. New Meruop or ConstRuctTiInG 
STRUCTED FROM Data IN TABLE 1. PROFILES. EXPLANATION IN TEXT. 








os O 


Ficure 3. Prorites in’ Mopiriep PoLar Ficure 4. Prorre in Mopiriep PoLar 
COORDINATES. EXPLANATION IN TEXT. CoorpInaTEs. Dorrep LINE PROFILE Is THE 
PROFILE OF FIGURE 1. EXPLANATION IN TEXT. 


Since for a given set of points an infinite number of different curves can be 
drawn, some criterion is needed for a ‘best’ curve. We suggest that such a criterion 
might well be: that equation of lowest degree which defines a line that passes through 
all the points and that satisfies the last squares criterion in some specified manner. 
This criterion is faulty in that it does not specify in just what way the least squares 
criterion will be applied. This may be decided on the basis of the degree of pre- 
cision required or obtainable in practical situations. 


There are other mathematical representations or descriptions of a line, but the 
ones given previously seem to hold the greatest promise for practical application 
to the phenomena of profiles. There seems to be a remote possibility that the repre- 
sentation and analysis of profiles in the future might be facilitated by recourse to 
the harmonic analyzer, an instrument used in the physical sciences to study oscilla- 
tory phenomena, or some such instrument yet to be devised. If such ever becomes 
possible, the machine will yield the necessary constants for the profile. Pending the 
advent of such a machine, an interesting problem presents itself when one considers 
the applicability of Fourier series and Fourier integrals to the problem of profiles. 
The Fourier integral approach seems to hold considerable promise. 
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Profiles in Polar Coordinates. The previous discussion has been restricte: to 
profiles depicted in rectangular coordinates. An interesting innovation occurs when 
profiles are represented in the polar system. 

The polar system requires that one choose an origin or pole and then pass a 
fixed line through it. This fixed line is the polar azis or initial line. A point, P, has a 
radius vector, rho, and a vectorial angle, 8, as its coordinates. There are 360° around 
the origin. One can translate theta into rsp from 


8 = 3.6 rsp. (15) 
The radius vector, rho, is defined by the subtest scores. Any point, P, is defined then 
not by (rho, @) but by (Zs, rsp). Figure 3 shows the three patients’ profiles plotted in 


this modified polar system. These curves are the old spiral Archimedes; the polar 
equation for such curves is 


rho = a. (16) 
Our modification of the polar system permits us to rewrite the polar equation (16 
Ze = OTap, (17) 


as 


where a is the slope in rectangular coordinates. 


The modified polar profiles were constructed as follows: The pole is the mean 
Z, score, that is, zero. The maximum radius vector is equal to the subtest limits, 
that is, 5 Z,.. The angular quantity, 9, is translated into r,p, that is, the values of 
the correlation are laid off around the circumference of the circle delineated by the 
maximum radius vector sweeping around the pole. 

Figure 4 shows the ‘severe’ predicted profile as the midmost solid line. The 
outer solid lines are the bounds within which an ‘empirical’ profile for an individual 
having Z, = 4 should fall 50% of the time. The dotted line is the profile of Fig- 


ures 1 and 2. 


DIscussION 


Many questions have arisen in the reader’s thoughts regarding the possibility 
of practical application of the ideas presented here. No definite answer will be given 
to many of these questions but some indication will be given to show that the ideas 
presented in this article may have empirical relevance. 

Normality of the test distribution. Any continuous distribution can be transformed 
into a normal distribution and meaningful Z scores can be derived for any normal 
distribution. 


Number of items in a test. This offers no special problem. A practical definition of 
“‘a large number of items” might be, say, that the number of items in a subtest be at 
least 50 or 60. 


Linear and positive correlation. If correlation is not linear a transformation can al- 
ways be found which will make it so. All negative correlations can be made positive 
by simply reversing the subtest scales. 


Profile segments as Continua. When subtests have a very large number of items, 
intermixing of the items would yield a very, very large number of hybrid tests, each 
different from the other. If an individual got 80% of the items on one test ‘correct 

and 20% on another test, then if a hybrid test composed of an equal number oi 
randomly selected items from the two original tests were constructed, the percentage 
of items gotten correct by the individual on the hybrid test would tend to be lower 
than 80°,, but higher than 20°. The functional relationship of the hybrid tests 
scores with r,> would depend on many things: scoring or weighting method, types 
of original tests, ete. However, since the number of different hybrid tests would be 
extraordinarily large, it seems that theoretically one could select a set of hyb 
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tests such that the hybrid test scores were a linear function defined by the two 
score points of the original tests. If the number of hybrid tests was fairly large, then 
a profile segment could be regarded as a continuum. 

Since the two original tests would be considerably different, the above rationale 
applied to all profile segments would lead to what the present writer“? has called a 
Catinwum; an equation for the profile would also be an equation for the profile as a 
catinuum. 


Correlations or Factor Loadings? The abscissa of the profile graph might be in terms 
of a factor, in which case the limits could be 0 and 1, and a subtest ordinate would 
be drawn perpendicular to the abscissa at the point indicating the factor loading 
of the subtest. It seems that the profile would always have positive slope when the 
factor loadings of the subtests were all positive. The factor might take the place of 
the perfect criterion test and the factor loadings take the place of the correlations, 
rp. Lhe general nature of the profile curves has not been worked out, but it seems 
that this problem is capable of a rational solution. 


Profiles in Polar Coordinates. Rectangular coordinates may not always be the clear- 
est way to discriminate between different profiles. Also, certain problems can be 
handled in the form of polar equation more easily than in other forms. Since the 
present writer cannot foresee all the possible implications of either system, it seems 
that the polar system may suggest other problems and solutions that the writer does 
not perceive. 


Perfect test and profiles. The question might be raised as to the utility of a profile 
composed of subtests when a single perfect, near-perfect, or very good test is avail- 
able which yields a single score for each individual. We present the following argu- 
ments in favor of the profile even when a single good estimate is obtainable. First, 
there is no reason why the profile cannot be used to supplement a single good estim- 
ate. Second, there are many tests composed of subtests which yield a single total 
score and also a profile for the subtests. Psychologists tend to prefer the profile in 
preference to the single score. The Wechsler IQ and profile are examples. Third, 
since in empirical work perfection will never be obtained, a single test with high re- 
liability and validity might well be considered as our previously mentioned ‘perfect 
test’ and a profile constructed from a set of correlated subtests used to give a new 
estimate of the reliability and validity of the ‘perfect test’. Fourth, a different kind 
of clinical impression is obtained from a profile. The single score does not give the 
same impression. Fifth, a profile or line permits quite different mathematical an- 
alyses to be made. A point (a single score) has certain serious mathematical limita- 
tions. Sixth, the present interest in psychology is in clusters or configurations rather 
than single measurements. The syndrome itself, of major importance in clinical 
psychology, is generally conceived of as a configuration of attributes rather than a 
single dimension along which individuals can be ordered. The ‘single dimension’ 
concept may be reinterpreted as a unidimensional continuum which is, however, 
composed of or generated from several other dimensions. There is no reason why 
one cannot choose to use either or both the unidimensional multivariate continuum 
as well as the separate dimensions. 


The perfect test. The perfect test will, of course, never be empirically obtainable. We 
can suggest several ways of dealing with the problem. First, it may be possible to 
construct a highly valid and reliable test which can then be regarded as an approxi- 
mation to the perfect criterion. Second, a total score may be obtained from the sub- 
tests and used as the perfect test. Third, some type of factor may be located which 
indicates the saturation of a test with the factor. The perfect test would result from 
constructing a test which is highly saturated with the factor. Fourth, a unidimen- 
sional multivariate continuum could be generated from the subtests themselves. 
This is mathematically more precise than many of the present methods of obtaining 
a total score by arbitrarily weighting subtest scores. Fifth, the perfect test may be a 
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subjective continuum. For example, a judge or group of judges are asked to us 
normal distribution as a frame of reference and estimate a Z, score for ea 
dividual. Sixth, a measure which is or is thought to be very highly correlated 
the attribute under consideration might be used as the perfect test. 

This discussion is highly speculative. It should be regarded only as a sour 
ideas. The reacer should be extremely cautious in attempting to apply these con- 
cepts until he has traced out the statistical, mathematical and clinical implicatio 
ot our notions. 


SUMMARY i 

The syndrome is of great importance to clinical psychology. The syndrome is a 
complex, multivariate phenomenon. Syndromes differ from one another qualitatively 
and quantitatively. Syndromes may be analyzed subjectively or objectively. Math- 
ematical representation and analysis of the syndrome is one approach to syndromic 
problems. Because of the power and simplicity of mathematical language and math- 
ematical models, an attempt is made to talk about syndromes in this way. 

A more conventional approach is to conceive of the uncorrelated dimensions 
of personality as the dimensions of an n-dimensional sphere. An individual or a 
syndromic group is represented as a point in this hypersphere. Equations for the 
population hypersphere and a hypersphere for a syndromic group or an individuil 
were given. 

Psychologists like profiles. For this reason an attempt is made to rationalize: 
(a) a unique sequence for the subtests making up a profile, (b) unique distances be 
tween adjacent subtests in the ordered sequence, (c) a single continuum for the ab- 
scissa of a profile and (d) a continuum for the profile segments. In order to do this a 
set of assumptions are made which also permit the profile to be described math- 
ematically. Several equations for a profile in cartesian coordinates are suggested: A 
linear function, a broken line equation, that equation of lowest degree which defines 


a line that passes through all points of the profile and which also satisfies the least 
square criterion in some specified manner, and the possibility of describing and an- 
alyzing the profile in terms of Fourier series or Fourier integrals. It is suggested that 
profiles might be represented in modified polar coordinates. A polar equation is 
written for certain profiles. 
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INTRODUCTION 


This study is concerned with the exploration of a brief battery of quantitative 
tests to supplement the usual clinical evaluation of ‘“‘memory”’, and which could be 
ised in routine examination of the patient upon his admission to the hospital. 
Ideally, such a device should be relatively free of regional and educational factors 
and should not contain items in common with other tests “° @) which might be re- 
quired later as a further measure of memory function. 

The criteria of goodness of a memory battery are perhaps somewhat difficult 
to define, as memory itself does not have a unit meaning. However, such a battery 
should discriminate between groups of persons with and without ‘“‘memory defects” 
in the clinical sense, and should differentiate between clinical groups which would, 
in the psychopathological sense, be expected to show differences im invasion of the 
memory sphere. If sufficiently sharp, it should detect smaller areas or amounts of 
defect, perhaps not readily discernable at a gross clinical level, but nonetheless com- 
patible with the total clinical picture. 

For purposes of tentative definition, memory is here looked upon as the ability 
to respond at a particular point in time, on the basis of, or with reference to, stimuli 
which were present at a prior point in time. In order to test memory as thusly defined, 
it would be possible to use any of the classical methods of recognition or reproduction. 


MerrHop 


The original exploratory battery used in this study was composed of seven 
short tests, five of which are in current clinical usage, either as individual tests or as 
parts of other psychological tests. These were, in the order in which they were 
idministered : 


1. Pictures A-1 (Recall of pictured objects). This test is composed of drawings 
of ten common objects from the Mental Examiner’s Handbook“. The subjects 
were shown the pictures (objects common to men “».'°*) were presented to male sub- 
jects and those more known to women ‘?-*) were shown to the female subjects) 
and were instructed to name them. Immediately thereafter, the pictures were with- 
drawn and the subjects were requested (without prior warning) to recall as many as 
possible. This probably tests what is sometimes called “incidental memory.”’ 


2. The Navy-Northwestern Successive Matrices. This test was developed at 
Northwestern University by Hunt and French, who very kindly made it available 
for this exploratory study “’. Each matrix is composed of a series of discretely pre- 
sentedjgeometric stimuli which, if superimposed, would form a design. Three parts 
of the design are presented in rapid succession and the subject is then required to 
select, on a multiple choice basis, the fourth and correct one to complete the design. 
Fourteen such successive matrices form the test. Satisfactory performance on the 
series requires the adoption and maintenance of a “learning set’’, as well as the 
ability to ‘‘remember”’ the parts of the particular matrix. 

3. Symbol Recall. This test, designed by the authors, was incorporated in the 
battery as a visuo-motor equivalent of the auditory-vocal Digits Forward Test. Sym- 
bols from the Wechsler-Bellevue Digits Symbol Test? were formed into groups of 
from three to nine, by combining those symbols corresponding to the numbers ap- 
pearing in the standard Wechsler Digits Forward Test. Each series, printed on its 
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own card, was presented for five seconds; after each presentation the subject was re- 
quired to draw the symbols in the correct sequence. 


4. Pictures B (recall of pictured objects). A second set of ter. common ob- 
jects“*®) was presented to the subjects who were simply instructed to name them 
The pictures were withdrawn and the subjects were then directed to name them from 
memory. In this instance, however, the subjects had reason to believe, on the initia] 
exposure, that recall would be demanded. 


5. Digits Forward. This test was adapted from the Wechsler-Bellevue Scale, 
Form I“ and administered according to those directions. 


6. Pictures A-2. At this point, the subjects were asked, without prior warning, 
to name as many of the pictured objects originally presented in Pictures A-1 as was 
possible. Since the similar test, Pictures B had been interpolated between pictures 
A-1 and A-2, it was hoped that retroactive inhibition would operate to pick out thos: 
with greater susceptibility to the interpolation of similar material, i.e., those with 
poorer memory. 

7. The Benton Visual Retention Test (Form A). This is a standard test“)? in- 
volving memory for geometric designs. The subjects were presented a printed de- 
sign for ten seconds, the card was withdrawn, and they were then required to repro- 
duce the design (s) with paper and pencil. Seven designs of increasing complexity 
form the test. 


In each of the subtests, the score was the number of items to which correct 
responses were made. 

During one phase of the investigation, the Wechsler Memory Scale “° was ad- 
ministered concomitantly with the “‘Memory Battery’. A brief test of intelligenc 
level) was also administered, to all subjects. 


Subjects. Three major groups of subjects were used: 


Group 1 included 102 routine admissions to the Iowa Psychopathic Hospital! 
during the fall and winter of 1948-49, regardless of diagnosis; the sole criterion fo1 
inclusion was that the patient be sufficiently cooperative to be examined. The ag 
of the patients ranged from seven to seventy-nine years and formed a rectangular 
distribution, 


Group 2 consisted of 50 freshman student nurses, all of whom were volunteers 


from a class of 107. The age range was from eighteen to twenty-three years This 
group was randomly divided into twenty-five who were administered the memory 
battery as described above, and twenty-five who were administered the battery 
without Pictures B (in order to estimate the normal decrement on Pictures A-2 by 
virtue of the inclusion of Pictures B). This was not a “‘control group’’, the subjects 
being young, bright-average to superior adults. On @ priori grounds, they should be 
expected to exceed the patients on all tests. 


Group 3 was made up of 48 patients, similar to Group 1, admitted to the hos 
pital during the spring of 1949. In addition to the experimental memory battery, 
they were administered the Wechsler Memory Scale immediately after the experi- 
mentai memory battery, so that the latter might be compared with 2 standard 
“Memory ~ test. 


RESULTS 


Table 1 presents the mean and standard deviation for each of the subtests, for 
both the young normals (Group 2) and the patients in Group 1. In all subtests ex- 
cept Digits Forward, the normals exceed the patient group at or above the 5°; level 
confidence, as would be expected. The failure of Digits to discriminate the select 
bright normal group from the patients is of interest, in light of its widespré 
though non-standardized, use as a test of ‘‘memory”’ or ‘“‘attention’’. 
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LE 1. Memory Sustest MEANS AND STANDARD DEVIATIONS FOR CONTROL AND EXPERIMENTAL 
GROUPS. 











Nurses 
Total Group Group 
Group I II Patients 
Tests N=50 N=25 N =25 (N = 102) 








Pictures A Mean a | 5.9 
S.D. 1.8 








Matrix 





Symbol 











Mean 
8.D. 





Pictures A-2 Mean 
8.D. 


Visual Retention Mean 6.4 
8.D. 0.6 





When intercorrelations were run between subjects for the normal group, few 
were significant even at the 5% level of confidence; those with the highest level of 
confidence were the ones having the greatest a priori expectancies, namely: Pictures 
A-1 and A-2 (without interpolation of Pictures B), + .64; and Digits and Symbols, 
+ .44. The intercorrelations for the patient group were rather uniformly significant 
at a higher confidence level than those for the normals, probably as the result of the 
greater range of scores presented by this diverse group. 

Of particular interest in a battery of this type are correlations with chronological 
age and with mental level. Two of the subtests show significant (one per cent level) 
negative correlations with chronological age and significant (one per cent level) 
positive correlation with mental level (as defined by a previously described battery). 
These are the Navy Northwestern Successive Matrices (rh0c,y = —.40; rhom = +.53) 
and the Benton Visual Retention tests (rhoe, = —.40;rhom, = +.57). 

While nearly all of the tests used are inter-related, whether there is only one 
factor present in the statistical sense, is not at present known. 

Since one criterion of goodness of a memory test is the ability to discriminate 
those diagnostic groups with memory deficit from those without, the patient group 
was subdivided diagnostically into (1) psychoneurosis, (2) involutional and depressive 
psychosis, (3) schizophrenia, (4) organic psychosis, (5) psychopathic personality and 
primary behavior ‘disorder. Patients not falling into one of these categories were 
omitted from this present analysis. The organic patients were considered to be the 
criterion group. Table 2 presents the mean and standard deviation of the normals, 
and of each diagnostic group in the order mentioned above, for ‘“‘mental level’’ as 
well as each of the memory subtests. 
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The F-test (between psychoses—within psychoses), using the five diagnostic 
groups, indicated differences significant at the 5° level for Pictures A-1, Matrices, 
and Pictures B, and, at the one per cent level for Visual Retention. 

The differences between diagnostic groups were tested by ‘“‘t’’ on these four 
subtests and are presented in Table 3. It may be seen, by inspection of the figures in 
the preceding table, as well as by thesé data, that: (1) patients with organic signs 
were significantly below other groups at the 5 per cent level of confidence, or greater, 
on all four subtests; (2) patients with signs of depression were, at the 5 per cent 
level of confidence, significantly below those with schizophrenia on Matrices, and 
significantly below patients with psychoneuroses on Visual Retention; (3) Mental 
level, Symbols, Digits, and Pictures A-2 did not differentiate between the diagnostic 
groups; Digits failed to discriminate the patient groups from the superior normal 
group, while Symbols did not differentiate the normals from the psychoneurotic or 
schizophrenic groups. 


Tasie.3. “?’-Ratios oF DIFFERENCES BETWEEN D1aGNostic Groups ON SuspTests Pictures A-1, 
Marrices, Pictures }}, aNnpD VisuAL RETENTION 


Differences Pic A-1 Matrices Pictures B Visual Retention 


tPsychoneurotie-Organic 3.0383°* 2.016* 3.486** 3.735** 
iSchizophrenic-Organic 2.662** 3.313** 2.537* 3.284** 
iDepressive-Organic 3.664** 2.870** 2.526* 
‘Psychopathic personality-Organic 2.734** : , 2.868** 3.198** 
tSchizophrenic-Organic 


tPsychoneurotic-depressive 





* 


= 5% level of confidence 


** =1% level of confidence 

Pictures A-2, a test which it was hoped would be particularly useful and which 
was not, deserves some separate mention. It is to be noted that, in the normal 
sample, the group with Pictures B interpolated showed a significant drop on A-2 
while the group without B interpolated did not (Table 1). Thus, interpolation of 
Pictures B leads to greater forgetting of,-or ‘‘poorer memory” for, A. In the patient 
group, the decrease from A-I to A-2 was also significant. However, the difference 
between the decrements of the young normals and of the patients was not statistic- 
ally significant. It is now thought that Pictures A-2 was too difficult a test for all of 
the patients. What would happen on A-2 if practice on A-/ were increased beyond 
one trial (for example, if practice were given for three trials or perhaps to a criterion 
of one perfect recall) would seem to be worth investigating. Differential suscepti- 
bility to retroactive inhibition in the organic group might then be sufficiently spread 
to be demonstrable statistically. 

With regard to the criterion of goodness relative to prediction of memory 
deficit in individual patients, preliminary evaluation has been made by reviewing 
the mental status examination in each patient’s history and by an attempt to rate 
memory, as clinically judged by the patient’s physician, as being “‘intact’’, as *‘show- 
ing deficit’, or as “showing possible deficit.”” This was a difficult task, since the 
qualifying statements made with reference to memory might have meant that there 
was no particular involvement, or that there was reservation in the examiner’s mind. 
\n initially high criterion for ‘‘no deficit”? (that the report could not include quali- 
fications) was used. Any qualifications then placed the patient in the doubtful or 
“possible” category. 
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Out of the entire group of patients, seventy-three were said clinically to | 
intact memory. By using a cutting score of one point below the patient averag: 
three or more of the subtests which discriminated organic from non-organic patie: 
(i.e., Pictures A-1, Matrices, Pictures B, and Visual Retention,) sixty-four were « 
rectly assigned by the battery. Nine of the seventy-three patients would be susp: 
of memory loss by this quantitative criterion. These may be called ‘‘false positiv: 

Twelve of the patients had an unmistakable memory loss from the standpoi 
of the clinician. Of these, nine were correctly assigned by the three-out-of-f 
criterion. Of the three incorrectly assigned (the “false negatives’), one patient 
below on two tests, one was below average on one test, and the other was above ave 
age on all four tests. This last patient had involutional melancholia, was agitate; 
at the time of the clinical examination, but cooperated very well at the time the 
quantitative psychological battery was administered; hence the correlation of the 
original impression and the test data delt with two different conditions of behavior, 
and may be considered invalidated. 

Sixteen of the patients were judged, by qualifying statements made by the 
physician, to have some possible invasion of memory. Of these, nine appeared to 
be below average on three of the four critical subtests. Seven were ‘‘false negatives” 
which were again reviewed, and in four cases it was found that the clinical qualifica- 
tion was: ‘‘memory seems fair’. When these were deducted from the total of quest- 
ionable, and definite but slight, losses, the correct assignments were made in nine out 
of twelve cases. Thus it appears that, when no memory deficit was found clinically, 
there was 87°; agreement with the four subtest criterion. When a positive memory 
deficit was clinically described, there was 75°% agreement. It might be added, par- 
enthetically, that there is probably no more reason to regard these agreements as 
validating the quantitative findings than to regard the clinical data as being vali- 
dated by agreement with the test results. 

An additional group of patients was studied (Group 3) for comparative and 
v: = ating ee s, using the expe rimental and the Wechsler Memory Scale. The 
mean Wechsler M. Q. was 99.8 +15.9. 

"The correl: oath between the Wechsler Memor y Scale and the four critical sub- 
tests (Pictures A-1, Matrices, Pictures B, Visual Retention) were +.72 between sum 
of raw scores of the W. M.8.; and +.78 when Digits was removed from the W. M.5 
Since the “memory quotient” derived from the Wechsler raw scores seems to be a 
non-linear transformation it was not used in the correlations, the raw score being 
used. 

The data indicate a reliable degree of relationship between the two instruments 
Most of this relationship seems to be accounted for by three of the four critical sub 
tests: Matrices, Pictures B, and Visual Retention. With the exception of Digits, 
correlations between the subtests and the Wechsler Scale Were significant at the o1 
per cent level of confidence. 


SUMMARY AND CONCLUSIONS 


This paper reports the exploration of a brief battery of seven short “memory 
tests devised for use in the routine examination of mental hospital patients. Result 
ire presented from a control group of 50 student nurses and from two experiment 
groups consisting of 102 and 48 routine admissions to the Iowa Psychopathic Hos- 
pital. 

(1) Pictures A-1 and B, the Matrices, and the Visual Retention test, discrin 
inated a criterion group (organic) from other diagnostic groups. Conversely, Ment 
Level, Symbol Recall, Digits Forward, and Pictures A-2 did not so discriminat: 
Pictures A-2 might be made more selective by changing the method of administrati: 

(2) Matrices and Visual Retention both correlate relatively highly with Ment 
Level, which is itself not discriminatory, although they themselves are. This wou 
suggest that these two tests may be related to “intelligence” and ‘‘memory”’ 
differentiated processes. 
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(3) Matrices, Visual Retention, and Picture Recall may collectively serve to 
screen patients with clinically-defined memory deficits. At present the number of 
“false positives” seems relatively lower than the number of “false negatives’’, but 
agreement is, at the lowest, about 75 per cent between clinical assessment and test 
data. 

(4) These four tests may be useful in forming a nuclear group for the develop- 
ment of a memory battery. It should be pointed out that these tests are all visual 
(as opposed to items in the Wechsler Memory Scale which are, with one exception, 
all verbal). The correlation between the experimental battery and the Wechsler 
Memory Scale, nevertheless, was quite high: r = +.72 to +.78, suggesting that, 
clinically, ‘‘memory”’ is not particularly related to stimulus modality and is prob- 
ably a discrete process. 


REFERENCES 
Benton, A. L. A Visual Retention Test For Clinical Use. New York: Psychological Corporation, 
New York, 1946. 
FELDMAN, F. and Cameron, D. E. The Measurement of Remembering. Am. J. Psychiat. 1944, 
100, 788-791. 
Frencu, E. G. and Hunt, W. A. The Navy Northwestern Successive Matrices Test. Am. 
Psychol., 1949, 4, 268-269. 
GranaM, F. K. and Kenna, B. 8. Performance of Brain- Damaged Cases on A Memory for 
Designs Test. J. Abn. Soc. Psychol., 1946, 41, 303-314. 
GranaM, F. H. and Kenpaui, B.S. Memory For Designs Test. J. Consult. Psychol. 1947, 11, 
340. 
Jounson, H. K. and Lassner, R. Organic Deterioration. The Use of the Worcester-Wells 
Memory Test for Differential Diagnosis. Psychiat. Quart., 1945, 19, 283-194. 
Knorr, J. R., Cannicort, R. G., Umpercer, J. P., and Brroprav, I. McD. Brief Tests of 
Intelligence in the Psychiatrie Clinic. J. Clin. Psychol., 1951, 7, 123-126. 
Srong, C. P., Grrpner, J., ALBRecuT, R. An Alternate Form of the Wechsler Memory Scale. 
J. Psychol., 1946, 22, 199-206. 
WecustsEr, Davin. The Measurement of Adult Intelligence. Baltimore: Williams and Wilkins, 
Baltimore, 1944. 
Wecuster, David. A Standardized Memory Scale for Clinical Use. J. Psychol. 1945, 19, 87-95. 
We tts, F. L. Mental Tests in Clinical Practice. Yonkers: World Book, N. Y. 1927. 
We ts, F. L. and Martin, H. A Method of Memory Examination Suitable for Psychotic cases. 
Am. J. Psychiat., 1923, 80, 243-257. 
We ts, F. L. and Rugscu, Jercen. Mental Examiners’ Handbook. New York; Psychological 
Corporation, 1945. 





A FACTOR ANALYSIS OF THE BENDER-GESTALT TESTS OF 
MENTAL PATIENTS* 


WILSON H. GUERTIN 


Beatty Memorial Hospital, Westville, Indiana 


INTRODUCTION 


The Bender-Gestalt Visual Motor Test has been extensively used since its 
introduction in 1939), but discussions of the test and manuals for interpretation 
are based largely upon clinical experience and depend relatively little upon published 
research with the instrument. This would seem to be largely a consequence of the 
difficulties in quantifying such a complex performance as is seen in the reproductions 
of the Bender-Gestalt designs. 

Billingslea“?, in his very painstaking objective scoring method, demonstrates 
the extraordinary number of variables which are required to characterize these pet 
formances. It took him fifteen hours for each record, scoring on 137 indices which 
described the test performance. Billingslea recognized the need for reducing the 
number of indices and proceeded to do this by studying the consistency in the cor 
relations between analogous performances on different figures to see what general 
principles could be substituted for the individual indices of the separate figures. 
Through the use of an experimental group of psychoneurotics, he was able to draw 
some conclysions about the performance of such subjects. This study by Billingslea 
does not disclose much about factors which might underlie the performance of in- 
dividuals who are more disturbed than the subjects employed in his study. Actually 
very few studies have been reported in the literature which investigated the per- 
formance of schizophrenics on the Bender-Gestalt test(*). 

It is quite apparent that the Bender-Gestalt Test records must be examined by 
the clinician through the application of a reduced number of useful factors. The 
intuitive establishment of these factors has depended upon Gestalt theory and were 
set, forth in the original monograph®?. More recently, Hutt“ has done an excellent 
job in setting forth factors which should prove useful in a systematic interpretation 
While the Gestalters are content to establish the conceptual frame-of-reference by 
an appeal to the general Gestalt theory, more objective clinicians would prefer to 
see these concepts established on the basis of empirical] clusterings of variables. 

Determination of the most important factors underlying Bender performances 
of all types of mental illness is an almost impossible task. Not only would the 
sampling of subjects have to be very extensive but the number of scoring variables 
included would make the task a tremendous one. The present investigation is in no 
way a complete study but is merely an application of factor analytic techniques to 
this general area of visuomotor performance. 


PROCEDURE 


The subjects employed were 100 white patients at three mental hospitals’, in- 
cluding 33 females and 67 males. For statistical convenience the subjects were char- 
acterized by four classifications. The distribution of subjects by diagnostic groups 
were: 15 Not Psychotic, 23 Organic Brain Diseased, 29 Paranoid Schizophrenia, 30 
Other Schizophrenia, and three unclassified psychosis. 

The original quantification variables were selections from, and modifications 
of, those employed by Billingslea®?. While each Bender production was examined 
with respect to the 60 variables initially employed, it was apparent that the fre- 
quency of occurrence for many of these was so low that they could not Le studied 


*The author wishes to acknowledge the cooperation of the Superintendents and staff at Beat 
Memorial Hospital, Kalamazoo State Hospital and Indiana Hospital for Insane Criminals in making 
the data available. 
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TABLE 1. Opstique Facror Matrix* 


Factors 


Variables Figures 


Propensity for 
Poor Reality 
Constriction 
Poor Spatial 
Contiguity 


Movement 
Contact 


Distortion of No. of waves 
Tail are poor 

Non-closure 

Handle angle poor 

Part rotation 

Loss of angles 

Number distortion 

Waves flattened 

Crossing point wrong 
Tailing distortion 
Columnar spacing irregular 
Cireularity poor 


Rotation > 90° 

Figure order poor 
Cirele-period substitution 
Line quality irregularity 
Vertical slope irregularity 
Horizontal slope irregularity 


No compulsive restroking 
Linear regularity poor 
Vertical slope wrong 
Pointed loops 

Squareness poor 

Major are radius wrong 
Erasure present 


Undersize figures . 

> 1 page 

No heavy lines 

Oversize figures 

<1 page 

Lip approximation 

Handle attachment wrong 
Disproportion of elements 


Angle between elements poor 
Overlap of elements 
Juxtaposition poor 

Crossing angle poor 


Fig. extend to paper edge 20 
Not oversize 2 —20 
Rotation <90° 11 05 
Page balance poor 01 -—02 





*Decimal points omitted. 
**Based upon orthogonal matrix not reported here. 
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statistically, and so they were dropped from the study. Only those items which .p- 
peared in more than 19 per cent of the cases in the tail of the distribution were re- 
tained. Six of the variables were dropped because of obvious experimental depeid- 
ence. The final 41 variables employed bear some resemblance to the original /jl]- 
ingslea indices. For editorial convenience these items will not be defined in detail 
and for those terms which seem vague or ambiguous, reference can be made to the 
word usage in Billingslea’s monograph, except as otherwise indicated. To sum- 
marize, three general principles were operating in the retention of variables to be 
studied. They were: ease of scoring, lack of experimental dependence on other var- 
iables in the scoring, and frequent enough occurrence in this sample of subjects. An 
additional three items met these criteria satisfactorily but had such low communal- 
ities that they were dropped in the final analysis. 

It was hoped that factors would be revealed which would be dependent upon 
gross inaccuracies in the records since the clinician seldom has the time to quantify 
his records carefully. Furthermore, it was felt that the productions of subjects, as 
disturbed as those employed in this study, disclosed sufficiently gross distortions so 
that more exact metric measures were not exactly necessary. 

The data sheets on each subject were transferred to McBee punchcards for ease 
of tabulation. Tetrachoric correlation coefficients were graphically estimated from 
Thurstone’s tables. The multiple-group centroid method of factor analysis®) was 
employed because it is time-saving and likely to produce empirically meaningful 
factors. The clusters which determine these multiple-group factors were identified 
by a conventional method which produces fairly tight and relatively independent 
clusters. Estimates of communalities were obtained by using the highest coefficient 
in each column. The communality accounted for by each of the variables was de- 
termined from the orthogonal matrix obtained by rotating the oblique factor matrix 
herein reported. 

RESULTS 

The cluster analysis was somewhat discouraging at first because of the difficulty 
in identifying tight clusters which were relatively independent of one another. This 
seems to be a consequence of the absence of high coefficients in spite of the fairly 
large number of moderately high coefficients which were present in the intercorrela- 
tion matrix. Nevertheless, six clusters were identified and six oblique factors were 
extracted simultaneously. This resulting six-factor oblique matrix for symptoms is 
shown in Table 1. 

The total variance accounted for by the six orthogonal factors constitutes 52 
per cent of the total variance of the intercorrelation matrix or 93 per cent of the 
estimated communality for these 41 test variables. The estimated communality, by 
taking the highest correlations in each column, was 56 per cent of the total variance 
of the intercorrelation matrix. 

Factor loadings for diagnostic groups are shown in Table 2. It will be noted 
that only four correlations greater than .30 appear in the matrix. Table 3 shows the ‘ 
intercorrelations between the six oblique factors (Rpq). The correlations are rather 
high and give seme additional information about the nature of the factors. 


Discussion oF RESULTS 

Much of the Bender-Gestalt performance can be understood in terms of factors 
since the communality of the intercorrelation matrix is greater than half of the total 
variance of the matrix. This makes it possible to employ clusters of scoring items 
(i.e. factors), instead of single items, in analyzing the performance of mental patients 
on the Bender-Gestalt. 

The location of reference axes is always equivocal. The application cull 
ventional cluster analysis to locate the factors as well as the correspondence 
these factors and general psychodiagnostic and Gestalt principles, argues io: 
satisfactoriness of the positions of these reference axes. 
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TaBie 2. Facror LoapinGs For D1aGNostic VARIABLES 


Diagnostic group 


Factor Non-Psychotie Organic B. D. Paranoid Other Schiz. 
Schizophrenia 
Propensity for 00 -.10 —.05 24 
Curvilinear 
Movement 


Poor Reality -.27 -.39 .25 .09 
Contact 


Careless —.07 —.16 —.09 13 
Execution 


Constriction .30 —.04 05 —.22 

Poor Spatial 25 —.33 -.14 .24 
Contiguity 

Unidentified .35 -.20 00 .28 





Factors* 


A B ( D I I 
A 1.000 .480 .650 211 437 .258 
B .480 1.000 .313 .342 .367 .043 
C .650 313 1.000 .433 .490 177 
D 211 .342 .433 1.000 147 .197 
E .437 367 190 147 1.000 .143 
F 258 .043 177 197 143 1.000 


*The capital letters symbolize the oblique factors in the order of their presentation in the text. 


It-should be emphasized that these factors were obtained from a disturbed 
group of patients. Very few of the productions were from individuals with what 
might be termed mild neurotic disorders. This will tend to disclose factors which 
depend chiefly upon such things as poor reality contact, but the sampling is such as 
to exclude other possible factors such as one attributable to anxiety. A factor analy- 
sis of a neurotic sample would be essential before anything like a complete conceptual 
scheme of test evaluation could be proposed. 

These six factors have been named inductively by considering the symptoms 
which showed the highest factor loadings: 


The Propensity for Curvilinear Movement Factor is characterized by distortion 
in figures due to a disturbance in curvilinear movement. This is reflected on the one 
hand by an excessive number of waves in figure 6, and on the other by rotational 
tendencies seen in actual rotations or more particularly in rotations of parts of the 
figures. Non-closure of elements is also important here and seems to be based chiefly 
on the fact that the non-closures in this sample were almost entirely related to diffi- 
culty with the curvilinear element of figure 4. Underlying this clustering of test 
variables would seem to be a poor emotional control which makes it difficult for the 
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individual, having perceived the Gestalt, to maintain ii. The flux or instability 

where this factor ‘s operating can be observed clinically in schizophrenics particu 
ly of the catatonic subtype. It will be noticed that a factor loading of .24 for Ot 
Schizophrenics was the largest loading for any of the diagnostic groups. This fa 


seems to be quite important as evidenced by the sizable loadings on an apprecial)le 


number of items. Bender, herself, makes much of this as is seen in her stateme 
“the inherent quality of all optic images is movement.’’®: »» * This movem 
tendency is a strong feature of most Gestalt principles and is seen by Bender a 
others as being basic to all visuomotor organization. The primitive features of suc] 
movement are found genetically in the view that small circular movements are tly 
elements upon which a child bases the later development of more complex perceptu 
movements. Hutt“ »- ' discusses modifications of curvature as a basic factor 
the modifications of the Gestalt. He attributes such modifications to deviations 
emotional functioning. The present findings support such a view. 


The Poor Reality Contact Factor is one which is familiar to all who work with 
psychotic patients. Among the important items here are the inability to maintain 
sufficient reality contact to reproduce the figures in even an approximate fashio 
and associated features which reflect this aspect in an inability to perceive or me 
the general requirements of the environment. The so-called regressive circle-period 
substitutions are encountered as an important feature of this factor. The Paranoid 
Schizophrenics show the highest positive loading for this factor as might be antici 
pated, while non-psychotics are negatively loaded with this factor. The highest re- 
lationship was for the Organic, Brain Diseased group who were loaded ~.39 on this 
factor. This would suggest that the Organic Brain Disease cases are capable of fairly 
accurate perception of the stimulus material with errors confined to smaller execu- 
tional aspects of the test. It would seem worth reporting the individual correlatio: 
between diagnostic types and the rotation of the figure 90 or more degrees. Thesi 
correlations give support to Hutt’s views“: »- * on the importance of rotations i 
psychoses. The correlations with Rotation are as follows: Not Psychotic, —.25; 
Organic Brain Disease, —.33; Paranoid Schizophrenia, .34; and Other Schizophrenia, 
.06. In the case of Rotation it appears that there is am element of active distortion 
present rather than just disorientation since the correlation is highest for the Para- 
noid Schizophrenics. 


The Careless Execution Factor is characterized both by a lack of care taken iu 
reproducing accurately and in compensating for such inaccuracies by such things 
as restroking. This factor would be important for careless individuals who are not 
ego-involved in their reproductions either because of a character disorder or psycho- 
sis. As far as is known, such a factor has not been formally proposed in spite of its 
obvious importance. Poor motivation or impulsiveness can exist in almost any type 
of case so it is not too surprising that the loadings for each of the diagnostic groups 
on this factor are all quite small 


The Constriction Factor embodies the well-affirmed psychodiagnostic principle 
that insecure people will constrict the magnitudes of their movements in a forma! 
testing situation. This factor is largely composed of size variables which have vary- 
ing degrees of experimental dependence. The only heavily loaded variable which is 
not concerned with size is the absence of heavy lines. The whole picture, then, is 
one of timidity with resulting constriction. So much attention has been given 
this factor in the literature that it seems unnecessary to dwell further on it. 


The Poor Spatial Contiguity Factor indicates a difficulty in juxtaposing the tv 
elements of the figure properly. The factor is characterized nicely by Hutt as, “dill 
culty with crossings, and closures’ “ »- ™) and is ascribed to psychoneurotics. It 
seen by the diagnostic loadings that on the one hand Organic Brain Disease is neg 
tively related to this factor, while on the other, the Non-Psychotics and Other Schi 
ophrenics are loaded positively with this factor. The psychoanalytic interpretati 
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of symbolic representation of conflict when two elements are juxtaposed may have 
some confirmation in these findings. 


The Unidentified Factor is composed of only three significantly loaded items. 
The nature of this factor is wholly unclear and the number of items loaded are 
relatively few. It seems desirable to disregard this factor in the present study and 
await further data before trying to understand it. 

Five group factors seem to be fairly well established by the present study but 
probably represent a relatively small portion of the total common factor space that 
could be identified., A better picture of this factor space could be obtained by study- 
ing all aspects of test performance for all types of subjects. Breaking the general 
area of diagnosis down into separate factor analyses on samples of organics, schizo- 
phrenics and psychoneurotics would seem to be desirable. Nevertheless, it is difficult 
to see how a similar extensive evaluation of the numerous performance indices can 
be made on the Bender-Gestalt test itself. A factor analysis of each figure would be 
a more thorough attempt at understanding the principles involved but would ignore 
interrelationships between figures. It would be necessary therefore to do an addi- 
tional analysis of the more general aspects of the test in relation to the factors re- 
vealed on each of the figures. ‘This would mean a total of 10 factor analyses, one for 
each figure. These 10 factor analyses should probably be repeated with the three 
separate diagnostic groups in order to provide a thorough study making a total of 30 
factorings in all. 

In conclusion, it is felt that factor analysis proves to be quite a laborious pro- 
cedure for establishing a conceptual scheme for interpretation of the Bender-Gestalt 
but the approach has been demonstrated in principle and some contributions relative 
to the empirical clustering of performance variables have been made. 


SUMMARY AND CONCLUSIONS 


1. The quantification procedures of the Bender-Gestalt in the literature sug- 
gested the possibility of factor analyzing the objectively scored test variables in an 
effort to see which ones clustered together. The study is a restricted one because of 
the large number of possible variables which might have been employed. 


2. ‘Fhe Bender reproductions of 100 mental hospital patients were evaluated 
in terms of a number of semi-objective scoring indices. The intercorrelation matrix 
of these variables was factored with the multiple-group centroid factor analytic 
technique. Factor loadings are presented for 41 test variables and four gross diagnos- 
tic categories. 

3. The five oblique factors obtained which seem to have meaning were as 
follows: Propensity to Curvilinear Movement, Poor Reality Contact, Careless Execu- 
tion, Constriction, and Poor Spatial Contiguity. 

4. The application of factor analysis in establishing a conceptual scheme for 
the interpretation of Bender-Gestalt productions is illustrated and the difficulties in 
terms of the labor involved in a more thorough study are discussed. 
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INTRODUCTION 


Although Rorschach interpretation has long been based upon several empirical 
assumptions about the meaning of such scoring categories as C, F, and M, relatively 
little experimental work has been done in regard to discov ering whether or not there 
are characteristic psychological or physiological correlates of these factors. Thus 
the Rorschach test, after more than 25 years, is still largely neglected from the stand- 
point of experimentally validated perceptual and personality correlates. A problem 
preliminary to discovering the exact personality correlates of the factors in question 
is to determine whether the Rorschach categories are actually me aningful to per- 
ception in general, and whether a subject’s perception tends to remain typical in 
other situations which would elicit behavior that could be scored in terms generally 
comparable to those of the Rorschach situation. As Bruner“? says, ‘Ink blots are 
no separate species of perceptual stimulus.” 

In a preliminary study Ulett“) reported the various subjective sensations 
noticed by subjects stimulated with flickering light. His control subjects reported 
seeing color, movement and patterns in response to the flickering white light and a 
few had well-organized visual illusions. The reported patterns changed with altera- 
tion of the stimulus in regard to intensity or rate of flicker... In addition to these ex- 
periences, some psychiatric patients reported that symptoms related to their illness, 
such as dizziness and falling sensations, were experienced during the exposure to 
flickering light. It was noted that there seemed to be a frequency specificity in var- 
ious patients and that when a critical rate of stimulation was reached, driving of the 
EEG pattern was enhanced and the subjective sensations were more marked. The 
relation of such phenomena to personality factors was investigated by Ulett? who 
found that it was possible to discriminate fairly well between normal and anxiety- 
prone individuals according to the kind of sensations experienced under photic stim- 
ulation. Anxiety-prone individuals were more likely to develop feelings of dysphoria. 

Since the subjects in the above studies of intermittent photic stimulation re- 
ported their induced sensations in terms which could be scored in a way comparable 
in some respects to the categories scored in the Rorschach situation, and since emo- 
tional and psychiatric symptomatology were evoked by a stimulus producing simul- 
taneously a change in their brain wave patterns, it seemed that here possibly was a 
way of relating determinants of perception utilized in the Rorschach test to the 
modes of perception in another situation in which neurophysiologic parameters were 
more accessible. 


SUBJECTS AND PROCEDURE 
In the present study 181 male subjects were used, 149 of whom were volunteers 


with a minimum of eighth grade education drawn from the Air Force reserves, regu- 
lar Army personnel, university and seminary students. The remaining 32 subjects 


*This report was completed under AF Contract Number 33(038)-13884. It is one of a series 
studies under D and R Project 21-37-002, Development of Psychiatric Screening of Flying Personn 
Department of Clinical Psychology, United States Air Force School of Aviation. Medicine. 

**The authors wish to express acknowledgment to Miss Ann Lawler who administered and 
seored all Rorschach tests, to Dr. George Winokur who did the psychiatric screening, and to M 
Rosemary Baessler who conducted the intermittent photic stimulation experiment. Thanks are also 
due to Dr. Goldine Gleser for statistical consulation. 
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were patients of similar age, sex and education selected from the hospital and clinic 
of the Department of Neuropsychiatry of the Washington University School of 
Medicine and Barnes Hospital. These latter had been diagnosed as having anxiety 
neurosis or some psychiatric disease in which anxiety was a major feature. All sub- 
ects were part of a larger study of anxiety-proneness? and hence had been care- 
fully screened both psychologically and psychiatrically for evidence of personality 
disturbance. All were subjected to a three-hour battery of psychological tests in- 
cluding the Rorschach, and were, in addition interviewed by a psychiatrist for one 
hour. For both the aforementioned situations they were rated on an 8 point scale for 
anxiety-proneness. Ratings of 1, 2, and 3 were considered normal, 4 and 5 border- 
line, and 6, 7, and 8, abnormal. Where the psychologist’s and psychiatrist’s ratings 
were at variance, a conference was held and a group rating decided upon, this rating 
isually being an average of the two original ratings. As a result of such screening, 
133 individuals were distinguished as being free from any significant anxiety (i.e., 
rated 5 or below). Sixteen of the ‘normal’ volunteers were considered to be very 
prone to develop anxiety under stress (rated 6, 7, and 8). The third group consisted 
of the aforementioned 32 psychiatric patients. 

The Rorschach test was administered first in all cases and, several days later, 
the intermittent photic stimulation experiment was carried out in the electroence- 
phalographic recording laboratory. The method reported by Ulett“ was used in 
the latter situation. The subject was seated and a flashed-opal glass screen 30” in 
diameter placed about six inches from his face. Thus the flickering light filled the 
visual field almost entirely. Directions given to the subjects were to report ‘‘any 
sensations experienced other than the flashing off and on of a white light.’’ Fre- 
quency of stimulus was varied over 22 frequencies from 2 to 30 flashes per second 
presented in ascending order. Each frequency was maintained for forty seconds 
followed by forty seconds of rest. An attempt was made to elicit some description 
of the subjective responses to each flicker frequency. All such sensations were re- 
corded verbatim. 


Rorschach protocols were scored according to the system of Hertz. The reports 
of the subjective sensations induced by photic stimulation were scored as similarly 
as possible to Hertz’ method of scoring for the Rorschach and without knowledge 
of the ‘Rorschach results. Criteria for the scoring categories used in evaluation of 
this material are as follows: 


C was defined as any mention of color. It was felt necessary to differen- 
tiate between a response of one color and many — the latter warranting a great- 
er color score — and so a separate C was given for each color mentioned in each 
response. A response of “many colors” was credited with one C. No FC or CF 
categories were used since, in this methodological study, it was deemed imposs- 
ible to differentiate accurately among these three color categories. 

CR the number of different frequencies at which there was some color 
response. 

C’ included any mention of black or white (other than on and off flash of 
the light), for example; shades of gray or black, black venetian blinds, black and 
white design, or streak of lightning. 

F, was the scoring used for any fairly well defined object such as a cross, 
swords, an airplane wing, turtle with a top hat, waterfall, pages of a book, 
honeycomb, fan, or wheel and spokes. 

F, signified insufficiently described forms such as designs, figures and 
objects. 

m included movement of any obiect or organism, for example; moving geo- 
metrical figures or a flock of seagulls flying. No attempt was made to differ- 
entiate between inanimate movement and human or animal movement as the 
incidence of the latter types of response was extremely rare. 
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FK was the scoring used for vista or suggestion of three-dimension 
panse, for example; movement toward or away from self, a tunnel or a view 
an airplane. 

e was defined as surface appearance or diffusion, for example; surfa 
the moon, shimmer, vibration, clouds, blue haze or white blur. 


TasLe 1. Tests or SIGNIFICANCE OF DIFFERENCES IN MEANS FOR DIFFERENT GrovUPs 0 
PuHotic VARIABLES 


t between normal- t between normal- t between anxious 
anxious normal anxious patient normal-anxious patient 
1.26 44 1.40 
44 31 51 
.02 42 29 
.78 05 37 
.03 .26 06 
63 05 2.08* 
37 61 71 
65 25 2.00 
Stereotypy 89 87 53 
No Response 12 1 .20 
Unusual Response 26 4 .96 
df == 147 df = 163 df = 46 


* Significant at .05 


The above categories and criteria were selected after preliminary scoring of th 
first fifty protocols from the photic stimulation experiments, and correlation of the 
major factors with corresponding Rorschach data. For such comparisons we used all 
of the Rorschach scoring categories which appeared in our protocols with sufficient 
frequency to warrant analysis. A!though there was some difficulty in matching the 


two different types of scores, we initially tried all that seemed to have some similar- 
ity. Many comparisons were abandoned early, especially those flicker categories 
in which there were very few responses, or Rorschach categories which were too re- 
fined for the type of reports given during photic stimulation. The present criteria 
are still largely @ priori in origin and could undoubtedly be more precisely related 
to Rorschach factors in further empirical studies. This paper can be considered only 
a first attempt to find the scoring categories which might be comparable to those 
of the Rorschach test. 
RESULTS 

Before ascertaining any relationship between the two sets of variables, a pre- 
liminary comparison of mean normal and anxious photic scores was made to se 
whether there might be a real mean difference on any factor for the groups. Mean 
values for the three groups (normal controls, anxiety-prone controls and patients 
with anxiety) were found and ¢-tests computed. Only one sig nificant mean difference 
was found. The anxious patients rated significantly higher on photic ‘‘m”’ score than 
did the anxiety-prone controls. However, since this t-was varely significant at the 
.05 level of confidence, and 33 t-tests were made. tie aifterence could be expected 
by chance. Moreover, any conclusions based or. the t-tests should be tentative sinc 
the distribution showed marked positive skewing. 

It was then necessary to determine whether those individuals who scored re!a 
tively high or low on any Rorschach variable tended to rate high or low also on the 
corresponding photic variable. For example, does the individual with a relatively 
large Sum C tend to report much color under intermittent photic stimulation? There- 
fore, the median of the total distribution of each Rorschach and photic variable was 
ascertained and every subject was rated “above” or “below” the median on each 
factor. In cases where the median fell within a group of scores at one point on t 
distribution, division was made above or below that point, depending upon wheth:r 
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the median fell toward the upper or lower part of the group in question. Chi-square 

was then used to determine whether there was a trend toward consistency of response 

in the two situations. Column 1 of Table. 2 shows those factors which were compared. 

For the total group, Rorschach Sum C and flicker C were related p = <.05. Ror- 

schach M and flicker movement responses were related p = <.001. 

Taste 2. Tests or SIGNIFICANCE OF RELATIONSHIP BETWEEN KORSCHACH AND INTERMITTENT 
Protic VARIABLES 


Chi-Square Values 


é Anxious Anxious 
Rorschach Flicker ‘ Normals Patients Normals 
Variable Variable ] N=133 N=32 N=41 


=C C 46 5.12* 25 72 
aa 


( CR 2.4 2.28 .65 .00 
I F; 46 50 13 18 
N m 3. OF 36*** 4.09* 94 
Cc’ C’ a . 22 62 .50 
I m 4 13 .O1 54 
I 34 2.03 OS 
I FK &4 7.66** 3.96* 
I F; 3 18 01 
* Significant at .05 level of confidence. 
** Significant at .01 level of confidence. 
*** Significant at .001 level of confidence. 


From the new data it appeared that the patients were contributing to some of 
the relationships differentially from the normals. Consequently the patients were 
separated and the remaining relationship for the entire normal group is summarized 
in column 2 of Table 2. The C and M factors were found statistically to remain 
about equally related to their photic counterparts. 

Separate chi-squares were determined for the patient group as summarized in 
column 3.. The normal group corresponds rather closely to the total as is to be ex- 
pected since this was by far the largest sub-group included in the latter. The patients 
however, show some rather surprising deviations. In this group the relationship of 
Sum C to C apparently disappears and the correspondence between M and photic 
movement is attenuated (p = <.05). A relationship between Rorschach FK and 
photic FK becomes apparent in the patient group (p = <.01). In the patients also, 
the sum of Rorschach F+ responses becomes related to F2 or poorly defined form. 
(Actually, F: probably should have a closer relationship to F+ than should F; since 
more clear-cut illusions would be the opposite of form-boundness. ) 

To further examine these differences between the controls and anxious patients 
it was decided to study those control subjects who had been classified as ‘‘anxiety- 
prone.’”’ As there were only 16 such subjects (a group too small for using chi-square) 
it seemed logical to include from the “normal control’ group 25 additional subjects 
who had received scores of 4 on the 8 point scale for anxiety-proneness. Thus al- 
though they were grouped with the normals in a dichotomous scale they could never- 
theless be considered on the upper fringe of normal as both the psychologist and 
psychiatrist, in their rating, had called subjects rated 4 and 5 “‘borderline.’’ Column 
4 of Table 2 summarizes the findings in this group. It can be seen that, as in the 
anxious patients, the relationship of Sum C (Rorschach) to C (sensations evoked by 
photic stimulation) disappears as does also the relationship between the movement 
categories. A relationship between Rorschach FKand photic FK is apparent but 
less so than in the patient group. 

To find the degree of relationship between those variables which had already 
shown some inter-test consistency, it was decided to use some correlation technique. 
iixtreme positive skewing, especially large on the flicker variables, fostered sus- 
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picion that some spurious variability at the far right might tend to have und 
fluence on any correlations. Consequently, a seven-point coding scale was ad 
which tended to make the distributions more nearly normal. It was impossib\; 
use standard scores for complete normalization as too many scores were piled u; 
the low end. Pearson product-moment correlations were used as presented in Tal 


TABLE 3. CoRRELATIONS BETWEEN CorRRESPONDING RORSCHACH AND FLICKER VARIABLES. N 


Rorschach Flicker Pearson Chi? test 
Variable Variable r 


lineari 
C Bt ry 2 N.S. 
m yy <.6 2 N.S. 
7 “16 

FK 


This table shows three of the factors to be significantly related, although the 
correlations are low, ranging from .16 to .22. M and m still have the highest degre: 
of relationship, Sum C to C a somewhat lower relationship, and here again we find 
Rorschach Sum F'+ to be related to photic F2. Rorschach FK shows no relationshiy 
to flicker FK for the total group. Since slight deviations from linearity had not been 
ruled out, eta’s were determined for Sum C, M, and their corresponding photic fact- 
ors. The chi-square test of linearity indicated that relationships were linear. 


Discussion OF RESULTS 


The hypothesis being tested, namely, that the determinants of perception opera- 
tive in the ambiguous Rorschach situation are related to the determinants of per- 
ception in the similarly ambiguous photic stimulus situation, appears to be born 
out, at least for certain categories. Results would indicate that there is, to som 
extent, perceptual constancy for the individual in different situations. The un- 
measurable differences in the two stimulus fields probably make the obtained sim- 
ilarities of response the more significant. As stated in the presentation of results 
chi-square tests indicated that, in the total group, those who had a tendency to r 
spond in terms of color or to movement categories on the Rorschach demonstrat: 
similar tendency in the photic stimulation experiment. It is also interesting that 
FIX, a presumed measure of anxiety, is related in the two situations for the anxious 
group although not related for the ‘“‘normal”’ controls. 

The findings reported in other studies seem to substantiate the use of ‘‘M’’ as 
factor of a constant degree of importance for the individual in many perceptual situa- 
tions. For example, Klein and Schlesinger“? found that those who s¢ored high i 
Rorschach M also tended to experience apparent movement more easily in an « 
perimental situation. Although there is considerable interest in the “‘C’’ responses 
the reported studies are of a clinical nature and few relate to color theory. | 
fortunately, there is a paucity of experimental work in the literature for the ot! 
Rorschach variables. The lack of positive results, in the present study, for th 
variables indicates that perhaps the other Rorschach categories are not “‘pure”’ 
should be defined in a different manner, or that they represent less stable and fund 
mental aspects of personality. In this connection it is to be recalled that initially t! 
ink blot test was designed as a means for studying perceptual ‘‘types’’ and tl 
Hermann Rorschach was primarily concerned with the interrelationship of mo’ 
ment and color responses. It is to this aspect of the Rorschach test situation that t 
present study has suggested a possible approach. 

It must, of course, be emphasized that since so little is known of what under! 
the apparent predictive ability of the Rorschach, we do not possess methods of « 
termining in an exact manner how comparable are the two methods used here. 

One major difference however between the Rorschach and photic stimulati 
situations may be the subject’s interpretation of the test. The procedure of phot 
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stimulation is not likely to be considered an evaluation of imagination or of any other 
“mental” attribute. It is more likely regarded as part of a physiological procedure 
and hence the subject would not be ‘‘on guard” against revealing anything which 
he knows or fears to be true of himself. Also, the relationship between examiner and 
examinee is minimal (since the subject merely reports what he sees and the examiner 
records) in determining the results of photic stimulation. This makes it less probable 
that significant motivations, defenses, interests, etc., are brought into the situation 
since there is little interpersonal stimulation or threat. 

In this regard, Schachtell > states that interpretation of the Rorschach situa- 
tion is an important factor in determining the subject’s response. The individual’s 
sensitivity to authority ‘“‘may lead to a considerable variety of subjective definitions 
... of the Rorschach situation.”’ If, as is stated by Rorschach, among several other 
types of patients, the anxious are likely to take the test seriously, and, if, as we have 
assumed, the photic stimulation is interpreted as non-threatening by all subjects, we 
might expect to find a greater discrepancy between the flicker and Rorschach scores 
among the anxious than among normals. We find that this is true, and the distribu- 
tion would possibly be even clearer were it not for the variable amount of dysphoria 
that may be induced during the process of photic stimulation. One very noticeable 
difference between normal and patient groups is that despite the significant relation- 
ship in color scores for the normals there is complete absence of relationship for color 
in the two anxious groups. It is plausible that ‘‘C,’’ interpreted as spontaneous emo- 
tional reaction, should be inhibited in a situation interpreted as threatening by the 
subject. 

It is known that intermittent photic stimulation alters the EEG © © ©, 
Since it has been shown “* that some subjects tend to report the most vivid illusions 
at the point where this alteration is most marked, some physiological interpretation 
of the present findings seems justified. It has long been known that brain tumors 
adjacent to the visua! area, and stimulation of exposed cortex of areas 18 and 19 can 
produce the sensation of stars, bright colored lights, etc.“. Also in epileptic sub- 
jects whose abnormally discharging focus may lie in the occipital region, the aura is 
frequently one of visual sensations. Therefore, it is not surprising that such a stim- 
ulus as we have used, which is known to affect the visual cortex ®’, would produce the 
visual experiences which have been described above. Grey Walter“® has suggested 
that, when other than visual sensations occur with photic stimulation, the effect of 
stimulation is spread from the primary receptor area in the visual cortex to other 
regions of the brain. Therefore it seems reasonable to think that the individual’s 
characteristic specificities of response in the two stimulus situations studied here 
may have their derivation from an individual patterning of neural structures. 


SUMMARY AND CONCLUSIONS 


The present study was undertaken to discover whether or not the Rorschach 
categories are meaningful to perception in general, i.e., whether an individual tends 
to experience another ambiguous situation similarly to that of Rorschach testing. 
To determine this, 181 subjects were given the Rorschach test and were later sub- 
jected to intermittent photic stimulation. The data from the second ambiguous sit- 
uation were scored in a manner comparable to Rorschach categories. The con- 
clusions derived were: 

1. Inthe control population, the quantity of color and movement as scored on 
the Rorschach test responses was found to be related to the amount of color and 
movement produced in the subjective sensations resulting from intermitent photic 
stimulation. 

2. The above relationship is disturbed in patients with anxiety and in control 
subjects who were considered anxiety-prone. 

3. Among patients with anxiety and anxiety-prone control subjects, the 
amount of FK response on the Rorschach was found to be related to a similar per- 
ceptive reaction induced by intermittent photic stimulation. 
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4. The finding of a relationship between photically induced sensations 
torschach variables may well lend some hope for the validation of a person 
theory based on perceptual types as proposed from Rorschach’s early experime: 
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LEVELS OF DATA IN RORSCHACH INTERPRETATION 
CALDWELL, BETTYE MCD., ULETT, GEORGE A., MENSH, IVAN N., AND GRANICK, SAMUEI 


Washington University School of Medicine 


INTRODUCTION 


Although widely accepted as a useful clinical tool, the Rorschach technique is 
not firmly established as a scientific method of personality assessment. Correct 
interpretation by means of this technique depends on the validity of many different 
hypotheses, all of which can ultimately be reduced to some type of numerical data, 
including as they do subtle observations of behavior which may contribute heavily 
to the avcuracy of interpretation. This reminder has been frequently given clinicians 
in an attempt to sober the claims made for the instrument. Devotees of the method 
have, however, been loathe to consider it a criticism, and one has stated that in 
using the procedure “*. . . insight and intuition must be given a free hand.” “. P. 5% 

However brilliant and penetrating are interpretations made from artistic use 
of the method, it would seem necessary to continue attempts to remove insight and 
intuition as much as possible and replace them with scientific objectivity. On 
method of accomplishing this is to establish more firmly the relevance and useful- 
ness of the quantitative categories into which the responses may be classified or to 
determine the extent to which behavioral cues contribute to personality analyses. 

In an earlier publication Ulett recognized the need for such an effort and pro 
posed the following approach: 

“The role played in interpretation by factors other than the amount and interrelation- 
ship of scoring categories has not been clearly elucidated. . . A useful approach to this prob- 
lem might be to separate, in the Rorschach interpretative report, those interpretations based 
upon the summary of scoring data alone from the form and content of the subject's replies 
and to include as a separate report observations of behavior made during the test admin- 
istration.”’ 2, p. 548) 

In the present study these various approaches have been conceptualized by the 
authors as representing three levels of data which are available to the clinician who 
uses the Rorschach technique. At one level there is the subject’s test behavior in its 
complex, including the specific content of the responses and the scoring symbols 
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into which they may be translated. At another level, the interpreter may have 
access to only a transcript of the responses and must attempt to understand these 
without having actual face to face contact with the subject. At the final level only 
the scoring summary or psychogram remains, and interpretations must be made 
without benefit of either the behavioral data or the actual content of the responses. 

To establish how much a reduction in level of information reduces the efficiency 
of interpretation of Rorschach data is important both for the practicing clinician 
and for the person interested in clarifying Rorschach methodology. Not only would 
this help to determine to what extent the clinician is basing his interpretation on 
knowledge of the correlates of certain types of gross behavior or on his general in- 
tuitive ability, but it would also shed some light on the meaningfulness of the various 
scoring categories and upon whether they provide efficient substitutes for the rich, 
qualitative material present in the subject’s behavior and in the language and 
content of his responses. 

DersIGN 
The levels to be investigated were defined as follows: 
Level J. The interpreter administers the Rorschach technique to a subject, observes the 
subject’s reacticn to the procedure, makes note of any incidental verbalizations, feelings, 
ete. 
Level IT. The interpreter has access to a copy of the complete test protocol, which he may 
or may not then convert into scoring categories depending on his own personal preference of 
interpretive procedure. 

Level III. The interpreter is given a copy of the scoring summary only, including the 

number of responses falling in different location, determinant, and content categories, average 
reaction time for the achromatic and chromatic cards, and number of popular and original 
responses. 

The study was designed so as to permit measurement of the effects of the various 
levels in two ways. The first procedure consisted of comparing the’ consistency of 
ratings made by three different interpreters all having access to the same level of 
data with those made when the levels were varied. This necessitated the use of an 
experimental group in which levels were varied and a control group where they were 
held constant. Subjects for the experimental group were 24 unclassified psychiatric 
patients being studied in a psychiatric bospital which admits patients whose dis- 
orders range from minimal to severe. The control group consisted of 10 comparable 
subjects whose completely recorded protocols were drawn at random from the psy- 
chological test files. None of these 10 subjects had been examined by the three 
psychologists serving as judges in the present experiment. The raters were all 
trained psychologists with a minimum of three years of clinical experience. 

Table 1 describes the program of testing of the experimental group, in which an 
effort was made to randomize the influence of the individual raters by having each 
rater see 8 subjects at Level I, 8 at Level II, and 8 at Level III. 

TaBie 1. Ovuriine or TesTInNG Program Or EXPERIMENTAL Group (N = 24) 
PERMITTING SysTEMATIC VARIATION OF LEVELS OF Data 


Level I Level II Level III 


Psychologist A 
Psychologist B 
Psychologist C 


The second procedure for determining the effects of the levels was to compare 
interpretations made on the same subject by raters having different levels of data 
with some outside criterion. That is, would an interpretaticn based on Level I 
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agree more closely with the criterion than an interpretation based upon Level 
The criterion chosen was that of psychiatric ratings made on the 24 subjects of 
experimental group. In each case the psychiatric rating was made by a psychiat 
who had not only interviewed the subject for at least an hour but had also conferred 
with the referring doctor and the resident physician who had observed the subject's 
ward behavior. 

Figure 1 presents an abbreviated copy of the rating form which was designed 
for the study and which was used with both the experimental and control groups. 


I. INTELLECTUAL FUNCTIONING 
1. Intellectual level ~70 2 3 110+ 
2. Estimate of intellectual potential -70 2 3 110+ 
3. Characteristic approach to problems and experience— 
quality of thought processes hypercritical 2 3. uncritical 
Range of adjustive behavior suggestible 2 3 rigid 
Ability to generalize lacking 2 3° excessive 
Respect for reality lacking 2 3. excessive 
Range of interests stereotyped 2 3 broad 
Unusual material predominant 2 3 none 
CONATIVE FUNCTIONING 
9. Activity level (amount of activity) hypoactive 2 3. byperactive 
10. Rate of ideas (flow of speech and 
thought) inhibition 2 3. flight of ideas 
Amount of wish activity (creative 
imagination, fantasy) lacking 2 3. excessive 
‘FECTIVE FUNCTIONING 
Emotional control none 
Euphoria none 
Dependency none 
Passivity none 
Aggressiveness none 
Hostility none 
Depression none 
Negativism none 
Anxiety none 


NTER-FUNCTIONAL RELATIONSHIPS 
Degree of insight absent 0 2 3 maximal 
Severity of illness mild 0 2 3 extreme 
Duration of illness brief 0 2 3. extensive 
Level of maturity immature 0 2 3 mature 
Interpersonal relationships poor 0 2 3. excellent 


rigid 

extreme 
extreme 
extreme 
extreme 
extreme 
extreme 
extreme 
extreme 


a 


Ficure 1. Ratina Form Usep ror Eva.vuation or Aut Supsects ON THE Basis or RorscHas 
Data AND Psycuiatrric INTERVIEW 


In designing the form, the authors attempted to fulfill two criteria. In the first place, 
the technique of assessment should not depend only upon individual categories and 
thus be subject to the criticism that the scoring categories had been used in a piece- 
meal fashion and that the total gestalt had been violated. Therefore, at every lev: 
the entire amount of data available could be utilized for making a rating on ever 
item. Secondly, it was the intention of the authors to include personality charact: 
istics which would be considered relevant to a psychiatrist as well as to a Rorscha 
expert. Accordingly every effort was made to avoid loading the scales with choic 
bits of Rorschachiana (such as degree of coarctation, response to inner prompting 
etc.) which often resist translation into practical terms. 

On the original rating form, too lengthy to permit inclusion here, the 25 v 
iables were fully defined for the raters. A four-point scale (zero to three) was chos 
in order to minimize any tendency to give average ratings for an item about whi 
the rater felt insecure. Direction of what might be considered a “good” rating v 
deliberately varied, with optimity sometimes at zero, sometimes at three, and son 
times at a point in between. 
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RESULTS 


Total agreement among raters as influenced by levels. The first analysis made was 
a comparison of the amount of agreement found when three raters were all judging 
the same material (control group) and when level of Rorschach data was varied 
(experimental group). The agreement was indicated by a score derived from a sys- 
tem of weighting which ranged from 6 when the agreement was perfect to 1 when 
there wes maximum disagreement. Such a weighted score wes used in preference 
to the raw scores, since mean raw scores computed from the three individual ratings 
could be misleading as to the actual amount of agreement present (e.g., ratings of 
2,2, 1 on a given item would receive the same mean score as ratings of 3, 2, 0, whereas 
the latter actually show less total conformity). No correction for chance agreement 
was made as this should operate equally in both groups. Table 2 presents the re- 


TABLE 2. COMPARISON OF EXPERIMENTAL AND ConTROL Groups ON ToTAL AGREEMENT 
ScorREs. 





Mean 
Agreement Sigma Difference 
Group Score 


Control 4.64 46 


Experimental 2- 4.56 39 .08 


sults of an analysis of all the items on the rating scale for all the subjects of the ex- 
perimental and control groups. The average agreement score in the control group 


in which all raters possessed the same level of data was 4.64. In the experimental 
group, where level of data was systematically varied, the agreement score was 
4.56. The difference between these scores is statistically nonsignificant, and any 
greater variability in the ratings made when different levels of data were available 
can be attributed to chance fluctuation. 

An analysis was made of the difference in agreement scores between the two 
groups for each individual item. The ¢-ratios for this analysis ranged from .03 to 
2.04, with only the last figure attaining significance at the .05 level. When so many 
analyses are made (25), at least one difference of significance at this level would 
have been predicted on the basis of chance alone. This one difference occurred on 
Item 7, Activity Level, and was in favor of the experimental group. That is, there 
was greater agreement in the experimental procedure where levels were varied than 
there was when all raters possessed the same amount of information. There is no 
need to attempt to explain this logically, however, as this one difference would repre- 
sent no more than chance expectation. This absence of significant differences in the 
individual item analyses was somewhat surprising, as on an @ priori basis the authors 
had hypothesized that for certain items the content of the responses would be vital 
for interpretative purposes and that on these items ratings made at Level III would 
be hazardous. 


Agreement with criterion as influenced by levels. The unit of measurement em- 
ployed on this analysis was the absolute discrepancy between the ratings made by 
the psychiatrist (henceforth referred to as Criterion or C-Level) and ratings made at 
each of the three Rorschach levels. This means that the discrepancy scores could 
range from zero to 3 points on each item in the rating form. If at Level II the ratings 
became less reliable than the corresponding ratings made at Level I, then the differ- 
ence between the discrepancy scores for Level C and Level I and the discrepancy 
between Level C and Level II should be significantly large. Exactly the same pro- 
cedure could be repeated for Level III. In Table 3 the amount of total and partial 
agreement on all subjects in the experimental group is presented in percentages. 
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TasB_Le 3. Percent oF AGREEMENT BETWEEN CRITERION LEVEL AND RatiInGs MaDe a1 
DIFFERENT Rorscuacn LEVE 


Complete 1-point 2-point 3-point 
agreement discrepancy discrepancy discrepan« 


Criterion and 


Level I 


Criterion and 


Level II 


Criterion and 


Level III 


Table 4 presents the results of the analysis in terms of discrepancy scores. 
TABLE 4. CoMPARISON OF DiscREPANCY BETWEEN Ratincs MADE AT DIFFERENT 


RorscHacH LEVELS AND OvutsIDE CRITERION (LEVEL C) on 25 ITEMs For 24 
SUBJECTs. 


’- 1) (C - Il) (C - Il) 


Mean 9.7§ 20.83 
Sigma f 6.03 


t 1.22* 


* t-ratio of difference between (C-I) and (C-IT). 
** t-ratio of difference between (C-I) and C-III). 


From the table it is apparent that although there is no significant difference, there 
is a Suggestive trend toward greater discrepancy as level of data is reduced, with the 
difference between the Level C-Level I discrepancy and the Level C-Level III dis- 
crepancy just escaping significance at the .05 level. 

The individual items were also analyzed according to this procedure. In com- 
paring Levels I and II with Level C, there is only one significant difference (¢ = 2.35 
on the 25 items. This occurred on Item 4, Respect for Reality, on which Level | 
ratings agreed significantly more with the ratings made by the psychiatrist. In 
comparing Level II] with Level I in terms of criterion similarity, however, three 
differences significant at the .05 level emerged. These occurred on Item I, Intellect- 
ual Level (t = 2.23), Item 2, Estimate of Intellectual Potential (¢ = 2.62), and Item 
17, Negativism (¢ = 2.61). On the first two the ratings made at Level I were sig- 
nificantly closer to the criterion, whereas Level III agreed more closely thari did 
Level I with the psychiatrist’s ratings on Negativism. 


DIscUSSION 


From the above results it is apparent that the interpretative rating made on 
the basis of the Rorschach technique is not greatly affected by varying the level of 
data given the interpreter. However, it is also apparent that there is a slight tend 
ency for ratings to coincide less closely with a criterion (psychiatric rating) whe: 
neither observation of the patient during test administration nor knowledge of th 
content of his responses is possible. An analysis of the individual items on which thi 
is most likely to occur gives relatively few clues as to just what might account fo 
this. For example, ratings on Functioning Intellectual Level and on Intellectua! 
Potential were the two items where Level III deviated more widely from the crite 
ion than did Level I. Yet this is an area where psychiatrists generally rely upo 
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psychological measures to validate their clinical judgment rather than vice versa. 
With respect to Negativism the ratings based on the scoring summary only (Level 
III) agreed more closely with the criterion than did those based upon observation 
of the subject in addition to the full protocol. This was particularly interesting in 
view of a marginal comment made by one of the raters that when rating Negativism 
from Level III he felt that he was doing little more than guessing. 

Although one may attempt to explain the pattern of results on the basis of the 
items used in the rating form, this does not seem to be adequate. That is, many of 
the items are difficult to rate on Rorschach data alone. However, in the construction 
of the form, the authors kept constantly in mind the fact that ratings would be made 
on the basis of both psychiatric interviews and Rorschach data. Furthermore, if 
this were really an important factor, more differences should have appeared on the 
individual item analyses on those items where the Rorschach is traditionally con- 
sidered more appropriate. In short, one must conclude that there is at best a slight 
reduction in sensitivity in using the Rorschach technique as one decreases the amount 
of data available for interpretation. This may either indicate that the scoring cate- 
gories are reasonably valid and do succeed in quantifying the more relevant features 
of the test data, or else that rating on the basis of even the maximum amount of 
data is itself too unreliable to permit such a comparison. The use of a dual method 
of analysis which measures both amount of agreement and similarity to a criterion 
suggests that the former conclusion is more warranted. 


SUMMARY 


An attempt to investigate efficiency of interpretation of Rorschach data based 
on the subject’s behavior in its complex (Level I), the response protocol (Level II), 
or the scoring summary only (Level III) revealed little loss in sensitivity of the 


instrument as level of data was reduced. This was true whether the method of 
analysis was comparison of the total agreement present when all raters had the same. 
level of information and when they had different levels, or whether similarity to 
an outside criterion was measured. However, there is a slight and suggestive tend- 
ency for ratings based on the maximum amount of information to agree more closely 
with psychiatric ratings than those based on scoring summaries alone. The results 
ind’ ‘hat the quantitative categories into which Rorschach responses may be 
tr represent meaningful behavioral attributes and that, with respect to 
j siaered relevant for this study, little information is sacrificed by relying 
u these quantitative data. 
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PROBLEM 


The chief objectives of this study have been (a) to devise an experimental de- 
sign and develop a system of dynamic concepts by which we could differentiate 
Rorschach results obtained by analytic from those obtained by non-analytic ther- 
apies; (b) to discover whether there are differences in degree and /or depth of change 
in analytic and non-analytic therapies; and finally (c) to determine, by way of con- 
trol, whether changes in subjects needing therapy could occur without therapy 
and, if so, in what respects and to what extent. 


METHOD 


Psychotherapy, for our purposes, was defined as a professional relationship 
involving regular sessions between an individual motivated to seek help with his 
emotional problems and a clinical psychologist or psychiatrist following a recog- 
nized system of therapy. The psychoanalytic type of therapy was defined as the 
technique attempting to deal with fantasy material through free association. Non- 
analytic psychotherapy (of the supportive type) was defined as a technique which 
does not attempt to probe into the deep fantasies of the patient, and relies on the 
patient consciously to supply material for interpretation. 

Forty two subjects, aged nineteen to thirty five, were employed in this study. 
The mean educational achievement of the analytic group was fourteen years. The 
mean educational achievement of the non-analytic group was thirteen years. The 
modal achievement of both groups was fourteen years. Nineteen of the subjects (14 
male, 5 female) were in therapy with one of the co-authors (TJD) as psychoanalytic 
patients. Fourteen subjects (9 male, 5 female) were treated by two of the co-authors 
(MHK and TJD) by non-analytic techniques. Nine subjects (5 male, 4 female), 
though diagnosed as in need of psychotherapy, failed to obtain it, but returned for 
Rorschach retest at a later date. These untreated cases, serving as controls had 
certain characteristics in common: (1) They manifested clinical anxiety at the time 
of testing; (2) their tesis indicated need for therapy; and (3) a subsequent crisis, or 
series of crises, outside the framework of therapy as here defined, occurred in each 
case. The general composition of the three test groups (analytic, non-analytic, and 
untreated) was well balanced from the point of view of tierapeutic need. The 
neuroses made up 70 to 85°; of each of the groups, while pre- and post- psychotics 
constituted the remainder. A state of balance existed likewise in regard to the aver- 
age periods elapsing between test and retest. The analytic cases averaged 12 months 
in therapy, and totaled an average of 150 hours during that period. The non-analytic 
cases spent an average of 13 months in therapy, totaling an average of 40 hours. In 
the non-treatment cases an average period of 24 months elapsed between Rorschach 
est and retest. It was established that such events as marriage, divorce, separation, 
failure at school, leaving home, sex experiences, etc., occurred in each case, moti- 
vating the subject to re-apply for service. 


Test PROCEDURE 
It has been the custom among Rorschachians to speak in terms of a molecular 


as against an atomistic type of protocol interpretation. The former indeed h 
served as the basis for most clinical report». Yet, the current literary output 
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Rorschach investigators, and especially some recent statistical studies popular 
among non-clinicians, show a tendency to revert to the segmental or atomistic type 
of analysis. The present study is an attempt to report on Rorschach findings in 
terms which experienced Rorschach clinicians have actually used for some time, but 
which in the literature have not received proper emphasis. To be specific, clinical 
experience with the Rorschach has demonstrated that the qualities of personality 
reflected in the test do not inhere in any one scoring category. For one thing, certain 
scoring categories may not appear in a particular protocol. Furthermore, the sig- 
nificance of a particular category as related to other categories necessarily varies 
with each case. For this reason, and to make global evaluation of Rorschach material 
not only more defensible but actual, we developed a system of dynamic concepts 
which are of especial interest clinically, and applied this system to each of the eighty 
four protocols with which we were concerned. The system of concepts, and the 
scoring items subsumed, appear in Table 1. 


TasBLe 1. A Scuema For C.LAssiFyInc RorscHacH MaTeEertAs* 


ANXIETY 6. INTEGRATION 


Diffuse shading Rs 

RT 

Card-turning 

Incidental remarks (especially 


t 
FY 
W% 
Re% 


requests for evaluation, etc.) Shading shock 
Evasive Rs Color shock 
Dd% Kite. 
(H + A) : (Ad + Ad) 
Shading shock 7. APPROACH 
Ete. M:C 
(Fm + m): Ch 
2. LIBIDO Re% 
Color Rs WwW, %D, %d, %Dd/S 
Symbolic content of Rs O% 
(especially cards II, III, & VI) Number P 
Nature of movement Rs Ete. 
Ete. 
8. REALITY TESTING 
HOSTILITY F+% 
m Fix 
Cc k 
5 M:C 
Sy mbolic content of Rs F% 
Nature of movement Rs FM% 
Cw, € Ete. 
Kite. 
9. PATHOGNOMONIC SIGNS 
GUILT Midline references 
Ch distribution Contaminations 
Symbolic content of Rs Perseverations 
Ch:C ratio Confabulations 
Ete. Positionals 
Disembodiments 
Bizarre associations 


5. INTERPERSONAL RELATIONS 
H% stereotypy 
Number and kind of M Ete. 
Number P 
Ete. 10. 


PRODUCTIVITY 
Number R 
tc % 
W:M 
O% 


*The particulars subsumed under the ten concepts in this schema are in themselves not new, and 
are meant to be suggestive rather than exhaustive. The system of concepts, and the range of items 
grouped under them, constitute what is new in this presentation. Each competent Rorschach worker, 
drawing on the literature and his own clinical experience, could add or detract from the list of particu- 


ars. 
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Essentially, our procedure was as follows: The subjects employed in this st 
presented themselves for diagnostic interview in the office of a psychiatrist or a ¢ 
cal psychologist, and were at once referred for Rorschach testing. The subjects y 
re-tested (as previously indicated, 12 to 13 months later) by the same person in ¢ 
case, and the two protocols obtained were evaluated on the basis of the concept 
the system employed. 

In terms of our experimentai design, each pair of Rorschach protocols 
evaluated by one of the co-authors who had had no previous familiarity with the 
cases, and was not familiar with the type of therapy employed. This individual 
judged the protocol as ‘‘changed”’ or “unchanged”’ in each concept area. His judg- 
ments were then matched agaist the judgments of another co-author who similarly 
evaluated a sample of the protocols. A high order of consistency was achieved in 
spite of the fact that, in global analysis as we have used it, precise weighting of each 
scoring entity is not possible. 


{ESULTS 
The test-retest material for each of the forty-two subjects was grouped accord- 


ing to presence and direction of change. The chi square test, with correction for 
small samples, was applied as a measure of significance (Table 2.) 


TaBLE 2. X? anp P VALvUEs 


Analytic Cases Non-Analytic Cases Untreated Cases 


RORSCHACH 
SCORING 
CLASSIFICATION € 4 : X! P 
ANXIETY 
LIBIDO 
HOSTILITY 
GUILT 
INTERPERSONAL 
RELATIONS 
INTEGRATION 
APPROACH 
REALITY TESTING 
PRODUCTIVITY 
PATHOGNOMONIC 
SIGNS 2.0 


Degree of freedom in all cases equals 1. 
Correction for small samples equals 0.5. 


The analytic cases showed significant change, at the one percent level, in the 
concept-areas of Libido, Hostility, Guilt, Interpersonal Relations, Integration, and 
Approach. The non-analytic group showed change, at the one per cent level of cv1 
fidence, in the concept-areas of Libido and Interpersonal Relations. In the untreated 
(control) groups, we found significant changes at the one per cent level of confiden 
in Approach and Productivity. 

The statistical significance of these findings underscores the importance a! 
reliability of the several findings in each of the three groups. The fact that statisti 
significance is lacking in four area-concepts (Anxiety, Reality Testing, Pathogno: 
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ic Signs, and Productivity) in the analytic group; in eight area-concepts (Anx- 
Hostility, Guilt, Integration, Approach, Reality Testing, Pathognomonic 
‘igns, and Productivity) of the non-analytic group; and in eight area concepts 
(Anxiety, Libido, Hostility, Guilt, Interpersonal Relations, Integration, and Reality 
Testing) of the untreated group, does not mean, of course, that some of the individ- 
uals in each of the three groups did not change considerably in any of the remaining 
areas. It simply means that the changes that did occur in other areas could not 
reasonably be expected to occur beyond the range of chance; and hence could not 
be said to offer grounds for accurate prediction. 

From the results obtained, we can see that either of the two types of therapy 
can apparently effect statistically significant changes in such areas as Libidinal 
Organization and Interpersonal Relations.This may mean that these two categories 
are interdependent; or that change in the one implies change in the other. It may 
even be that they can be plotted on a single continuum, and thus represent various 
degrees of one basic quality. 

Among the concepts that have proved significant for the analytically treated 
group, we find Hostility and Guilt. It is of interest to note that another concept, 
Integration, is alsu found significant for the same group. The hypothesis is advanced 
that, possibly, change in Integration is a function of change in Hostility and Guilt; 
and hence these three categories tend, for that reason, to occur together with equal 
frequency. 

The category of Approach does not lend itself to a similar hypothesis, since it 
does not seem to depend on therapy at all. It will be recalled that it was found with 
significant frequency in both the analytically treated group and in the totally un- 
treated group. The conclusion this suggests is that. change in approach is independent 
of change in other areas included in the system. For this reason it is highly probable 
that the area of Approach, as well as the area of Productivity, are the only areas 
that changed without therapy in those subjects with serious emotional problems. 
This does not mean of course that personalities cannot be modified in other ways, 
but that they do seem to be modified more consistently, and perhaps more easily, in 
these areas than in any of the others we employed. 

It is of some interest to note that Libidinal Organization and Interpersonal Re- 
lations were the only areas showing significant change in non-analytic subjects. This 
may mean that the findings of this study are essentially a function of the intensity 
of therapy, rather than of differences in technique employed. Since the non-analytic 
cases received (in terms of hours) less intensive treatment, it may be that shifts in 
these two areas occur—in a way discernible in Rorschach data—before other areas 
begin to shift. This need not mean that there is not a continuous change in libidinal 
organization beyond a certain point in therapy. It does mean that Rorschach be- 
gins to show it at a certain point. What is most interesting, of course, is that Lib- 
idinal Organization tends to change at all without so-called ‘‘deep”’ interpretation. 

Comment should be made on the categories of Anxiety, Reality Testing, and 
Pathognomonic Signs which did not show statistically significant change in any of 
the three experimental groups. Reality Testing and Pathognomonic Signs are taken 
to refer to degree of perceptual distortion. In dynamic personality theory, it is held 
that perceptual distortion is traceable to an anxiety disturbing the Ego. Further- 
more, anxiety in the personality (conceived in a total, quantitative sense) is an index 
of unresolved conflicts. It appears natural then for the category of Anxiety, as we 
dealt with it, to be last to show change in the ongoing therapeutic process, since the 
uncovering of deeper material always brings with it fresh onslaughts of anxiety upon 
the Ego. At the point where personality equilibrium is finally achieved, the total 
amount of manifest anxiety may be expected to be radically decreased, and the 
reality picture to become clearer to the subject at the same time. It is at this point, 
probably, that anxiety nuances become discernible on the Rorschach, and it is at 
this point, also, that reality testing becomes sufficiently intense to reduce patholog- 
ical signs in proportion. 
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The system of analysis employed was limited entirely, of course, to Rorschach 
signs. It is another problem to understand the process, and interpret the signi- 
ficance, of registering specific variables in Rorschach responses. Other changes may 
be seen clinically and through other tests. Furthermore, representation of change 
reflected in the Rorschach may not necessarily be present clinically. Clarification 
of these issues will eventually enrich our understanding not only of clinical processes 
but of personality generally. The findings of this paper do show, however, that the 

torschach technique reliably distinguishes personality changes in three groups of 
subjects, through the use of molecular, rather than atomistic, interpretations in 
areas meaningful to the therapist. 
CONCLUSIONS 

The experimental design involved the «°° of Rorschach test-retest with nineteen 
subjects undergoing psychoanalytic therapy, fourteen subjects undergoing non- 
analytic (supportive) therapy, and nine untreated subjects who were in need of 
therapy. Regarding the analytic subjects as our experimental group, we used the 
non-analytic and untreated groups as controls. The pathological status of the three 
groups as reflected in diagnosis was approximately equal in percentage terms. The 
average duration of therapy (not therapeutic hours) was almost exactly the same in 
both therapeutic groups; though in our untreated group time elapsing between 
Rorschachs was twice as long as it was in either of the therapeutic groups. 

Our findings indicate that personality changes were more extensive, and prob- 
ably more profound, in the analytic than in our supportive-therapy groups, though 
these changes could be explained in terms of intensity of treatment and the time 
variable (session-hours) involved. The therapeutic-control, or non-analytic, group 
showed less thorough modification of subjects’ personality dynamics, and did not 
show any changes which were not also significantly present in the analytically treat- 


ed group. For the untreated controls, changes were obtained in two areas, only one 
of which was also present in a therapeutic group. In a negative way, this finding 
tends to validate the significance of the changes in our therapeutic groups. No statis- 
tically significant changes were found in three concept-areas included in our system, 
and the explanation offered is that these are coordinate concepts whose modification 
must be expected, if at all, on even deeper levels of therapy than either of our thera- 
peutic groups were afforded in the time involved. 


THE DISCRIMINABILITY OF THE “DRIVE FACTORS” REPRESENTED 
IN THE SZONDI PICTURES 
HELEN M. RICHARDSON 
New Jersey College for Women, Rutgers University 


PROBLEM 


One of the basic Szondi assumptions is that the six photographs of patients in a 
given diagnostic category express a common “‘need-system”’ or “drive factor” which 
is present in exaggerated form in that type of mental disturbance. The significance 
of a subject’s choices is derived from his having discriminated and responded to this 
significant common feature. The individual picture in the category does not matter; 
it is the number of choices from the category that counts. From this, it appears to 
follow that the six pictures belonging to a given category should be perceptibly alike 
with respect to this common character. They should be discriminable on this basis 
from the pictures belonging to other diagnostic categories. Furthermore, this com- 
mon characteristic should be the most response compelling thing about each pictur: 
An attempt to match together the pictures which look as though they expressed th: 
same drive factor should therefore result in matchings that correspond to the di 
agnostic categories. The problem of this study is to find out how far this is actuall) 
the case. 
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SUBJECTS AND MeErHop 

The subjects were four men and six women of high average or superior intelli- 
gence. Four were between 20 and 25 years of age; two in the fifties; the other four 
between 25 and 50. Three were college undergraduates. Three served as subjects 
as a favor to the experimenter; the other seven were paid at the standard rate for 
student non-technical assistants. One of the ten subjects performed the experimental 
series twice. Only forty per cent of her matchings duplicated each other, and both 
of her series have been included in the count of matchings. Thus it is as if there were 
eleven subjects. 

The method was as follows: The eight cards of Set I were laid out on a table in 
front of 0 (the subject), about twelve inches from the table edge. The cards were 
laid out in the arrangement prescribed by Deri“: »- * for giving the Szondi test, so 
that they would be seen in the standard configuration. While laying out Set I, E 
(the experimenter) said: 


“Here is a set of pictures of eight people. Each of these persons may look 
to you as if he had some dominant emotion or attitude toward life or some 
characteristic way of feeling about himself and other people. Or he might look 
as if he were likely to act in a certain way.” 


Set II was then laid out after the standard pattern, but on O’s right and near 
the edge of the table. E continued: 


“Now here is another set of eight pictures. Try to match each of these 
pictures with the one in the first set that shows the same dominant emotion or 
attitude or way of feeling. Arrange the pictures in the second set so that each 
one is in the position corresponding to the corresponding picture in the first set.” 


When O indicated that he had finished, E removed Set II, leaving Set I on the 
table. Set III was then presented. O was told that there would be five sets to match 
with Set I, and that when these were finished Set II would be set up as a standard 
to be matched in turn by each of the other sets. Following this pattern, each of the 
six sets served its turn as standard as well as coming up to be matched against each 
of the other sets. Each set was therefore twice exposed to matching with each other 
set. 

No time limit was set, but E recorded the time as each of the sets was laid out 
as a standard, and when the last matching was completed. While O was working on 
a given set, E recorded the results of the preceding matching. Also E noted down, 
as far as possible, comments or remarks made by O. No explanation of the pictures 
was given by E other than that the same eight dominant emotions or attitudes were 
supposed to be represented in each of the six sets. One O, a professor of psychology, 
knew the nature of the test but was not familiar with the pictures. 

After O had completed the final matching, with Set VI as standard, E continued: 


“Now I want to see whether there is any relation between the kind of 
picture that a person is best at matching and the way he feels about that kind 
of people—whether he likes them or dislikes them or doesn’t care one way or 
the other.” 


With this introduction, E went on with the standard procedure for administering 
the Szondi test @ PP. *), 


RESULTS AND DiscussION 
The results of the matchings were tabulated on a grid with the eight diagnostic 
categories arranged from left to right and from bottom to top after the plan of 
Table 1. Within each category the pictures from I to VI were assigned spaces in 
similar fashion. It was thus possible to see what individual pictures were matched 
with each other, as well as to note the matching by categories. 
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Table 1 shows the frequency with which each category was matched with itself 
or with each other category by all subjects combined. The figures without paren- 
theses are based on the actual tabulations which covered the entire grid. Reading 
the figures across the row labeled Manic, we find the number of times that a manic 
picture as a standard received matchings from the pictures in the eight diagnostic 
categories, including its own. Reading down the column labeled Manic, we find the 
number of times that manic pictures were assigned as matches to pictures in the 
eight categories. The figures in parentheses combine the column with the row to 
show the total number of matchings of manic pictures among the eight categories. 
In the totals, the frequencies for matching of a category with itself have been doubled, 
since they must be counted both as rows and as columns. Counted thus, there are 
660 matches for each category. Chance expectancy for a given picture to be matched 
with a given standard is 12.5 per cent. 

Matchings of manic pictures with manic pictures and of homosexual with 
homosexual occur with almost three times chance frequency. A frequency almost 
double the chance expectancy appears for matching of paranoid with paranoid and 
of sadistic with sadistic. In these four categories, the self-matchings are significant 
at the .01 level. Even at this, however, they seem to fall short of the precision im- 
plied in Szondi test interpretation. In the epileptic, hysteric, catatonic, and depressive 
categories, self-matchings do not deviate significantly from chance expectancy. 

Table 1 is of some interest also for its revelation that there is no category which 
is not at some time matched with each of the other categories. True enough, this 
may be due to the fact that the matching was forced into a set and limited pattern. 
Every wrong match forced at least one other wrong match. The matchings which 
have a frequency of less than half the chance expectancy may be worth noting. 
Matchings of catatonic with homosexual, of manic with hysteric or paranoid, and of 
depressive with sadistic fall into this distinctly low frequency class. 

The total number of same-category or ‘‘correct’’ matchings, for all categories 
combined, amounts to 22 per cent of the total of 5280 matchings. Scores of individ- 
ual subjects ranged from 15 per cent correct to 29 per cent, with the median at 21.5. 
The lowest individual scores do not differ significantly from chance. The time taken 
for the matching ranged from 62 to 137 minutes. Rank-difference correlation be- 
tween score and length of time is —.44. 

More meaningful than the category matchings of Table 1 are matchings be- 
tween individual pictures. In this analysis we draw on the detailed chart of tabula- 
tions, which it is not feasible to set up in type. Our experimental procedure (with 11 
subjects) permitted a maximum of 22 matchings of each individual picture with 
each other picture (except, of course, that pictures within the same set were not sub- 
ject to matching with each other). In Table 2 are listed all the pictures which were 
paired in at least 50 per cent of the opportunities that were offered. The difference 
between. this percentage and the chance expectancy of 12.5 per cent is significant 
at the .01 level. There are 32 such pairings. (In Tables 2 and 3 and in the discussion 
following, the categories are designated by their initial letters, with ‘‘k”’ for catatonic, 
as they are on the back of the Szondi cards.) Ten of the pairings listed in Table 2 
are between same-category pictures: 4 involving matching of an m picture with an 
m picture, 3 of h with h, and one each of p with p, of k with k, and of s with s. The 
other 22 are pairs from different categories. 

In Table 3 are listed all the same-category pairs that were matched by our sub- 
jects with no more than chance expectancy; that is, in not more than 3 out of the 22 
opportunities. 

From Tables 2 and 3 we see that the relatively large number of correct match- 
ings in the manic category which showed up in Table 1 was due chiefly to the re- 
semblance among the m pictures in Sets II, III, and V. The m and k pictures in 
Set VI seem to bear no kinship to the other pictures in their respective diagnostic 
categories. It appears that Szondi subjects who choose the k picture in Set VI may 
be identifying themselves with m drives rather than with k drives. Similarly, choice 





388 HELEN M. RICHARDSON 


Tasie 2. Marcuincs BETwEEN INDIVIDUAL Szonp1 Pictures Wuicn OcccurReD Wir) 
FrReEQquEeNcY AT Least Four Trwes As GREAT AS THE CHANCE EXPECTANCY 








Frequency of 
Matching , Matched Pairs 


Same Category Different Categories 
(10 pairs) 22 pairs) 


KVI mIII 
kVI mil 
mI mV 
mI mI 
mIII mV kVI mV 
KVI sI 
IIT AIV pl pV kIT aI kIT hyl 
mI mII sI sll eVI mI eVI sII 
kI KIM KVI mIV III si ALI sVI_ hyl alll 
mIII sl ell dl sV pIV 
RIL RV RV AVI hI pIV AV sIII elV pli el dVI 
dVI plll pV KIIL kV dll 





TABLE 3. Same-CatTecory Picrurges WHich WERE Not Matcuep Wits Eacu OTHER 


‘ategory Pictures which were not matched together 


h 


el with ell, elV, or eVI; elV with ell, elll, eV, or eVI 
hyI with hyIII 
kVI with KI, kII, KIII, kIV, or kV; KIT with kIT 


plV with pI or pVI 


mVI with mI, mII, mIII, milV, or mV 





of eVI would seem to be more indicative of identification with the drives expressed 


in ml or sII than with eI or elV. kII has more in common with dIII and hyl t 
with AILI or VI, and hyl is more like dIII than like AhyIII. True enough, there : 
number of cases where same-category pictures are judged to show “‘the same do: 
ant emotion, attitude, or way of feeling.”” But more common are instances of | 
frequency judgments of similarity of drive between pictures from different diagn 
groups. 
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HELEN M. RICHARDSON 


The attempt to discover whether there was any relation between an individ 
success in matching within a given category and that subject’s likes and dislike 
the pictures yielded no significant results. However, these expressions of likes 
dislikes provide some data of positive value. The choices reveal wide difference: 
popularity among the individual pictures in a given category as well as among 
eight categories. A crude “preference score” for each picture was derived by tal 
the algebraic sum of all the positive and negative choices received by that pict 
from all subjects. These scores range from + 10 for mII to —8 for eV, kI and hyIII 
Our findings regarding the order of popularity agree closely, though not perfectly, 
with some results reported by Harrower®?. After conversion of Harrower’s data 
into preference scores by the method described above, the correlation (rho) between 
the two sets of scores was found to be .88 

Table 4.shows the preference scores for the six individual pictures in the eight 
diagnostic categories, arranged so as to show the range in popularity of the pictures 
within a category and the difference in total popularity among the categories. If we 
arrange the actual pictures in the order indicated by Table 4, and refer again to 
Tables 2 and 3, we can see that degree of smiling, of appearing friendly, sociable, or 
competent, vs. withdrawn, suspicious, or anxious, may well have been the kind of 
“emotion, attitude, or feeling’ on the basis of which our subjects matched the pix 
tures and also the basis for their preferences. In some cases, resemblance in these 
characteristics occurs between pictures within the same diagnostic category; in 
some cases it occurs between pictures in different categories. 


SUMMARY 


Tests were made to find out to what extent Szondi pictures having the same 
diagnostic classification would be judged to express the same “‘drive factor.’’ Cor- 
respondence appears essential to the assumptions underlying the Szondi test. Ten 


subjects twice matched the eight pictures of each set with those of each other set, 
arranging in corresponding positions pictures that represented‘‘the same dominant 
emotion, attitude, or way of feeling.’’ After the last series of matchings, each sub- 
ject was asked to indicate the pictures liked best and least from each set. 

Chance expectancy of matching any diagnostic category with any category 
(including its own) would be 12.5 per cent. The percentages of correct (same-cate- 
gory) matchings were as follows: h:32; s:22; e:12; hy:16; k:15; p:24; d:16; m:34. 
Four of these are significant at the .01 level; the other four are below the .05 level of 
significance. 

The frequency with which each individual picture was matched with each other 
picture was also noted. There were 32 pairings between individual pictures which 
occurred in 50 per cent or more of the opportunities provided by the experimental! 
procedure (significant at the .01 level). Of these 32 pairings, 10 were between same- 
category pictures; 22 were between pictures from different categories. The k pictur: 
from Set VI was consistently matched with m rather than with k pictures. 

Choices of pictures liked best and liked least disclosed no significant relation 
between factor loading and success in matching of category with category. It ap- 
pears, however, that degree of looking friendly, sociable, competent, vs. withdrawn 
suspicious, or anxious may have been a basis for both the choices and the matchings 
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{ RORSCHACH STUDY OF THE PERSONALITY STRUCTURE OF 
OBESE WOMEN* 


BENJAMIN KOTKOV AND BENJAMIN MURAWSKI 


U.S. Public Health Service Boston VA Mental Hygiene Unit 


INTRODUCTION 


Currently, there is a wave of interest in the problems of the obese individual. 
The literature discloses that a study of the obese personality is important because 
of the high incidence of the disorder™, its serious physical and physiological cor- 
relates“: © and a large number of studies“, * *, 5, which have reported or sus- 
pected a relationship between obesity and. more or less specific personality dimen- 
signs. 

The purpose of the present study is to approach the adult female obese person- 
ality through the use of the Rorschach test. The authors’ plan is (1) to determine 

there are reliable and differentiating Rorschach quantitative indices which char- 
acterize the personality structure of obese women as contrasted with women w ‘hin 
the ideal weight range, (2) to determine the degree in which these differentiating 
indices are present, and (3) to attempt to reconstruct a pattern of psychic-economic 
distribution of personality energy typical of the personality structure of obese wo- 
men. Later papers will treat nuclear conflicts and crucial relationships in the per- 
sonality of obese women as reflected in other projective techniques 


MetTHODS 


The subjects were volunteers in the Boston Experiment for Group Psycho- 


therapy with the obese“ sponsored by the Chronic Disease Division of the U. 8. 
Public Health Service in cooperation with the Massachusetts State Health Depart- 
ment. The obese population was composed of volunteers who responded to news- 
paper publicity and a radio program sponsored by the Massachusetts Society of 
Mental Hygiene. All applicants had to present a physician’s certificate for medical 
clearance. Sevringhaus’®® definition of obesity was used: “‘. . . obesity is an excess 
of fat over the normal expected for height, age, sex, between the ages of 25 and 
30... an allowance of 10°% variation above and below the average is to be made.”’ 
The “‘ideal weights” or controls were volunteers from a School of Public Health, 
Public Health Personnel, a Visiting Nurses Association, Dispensary Personnel, 
Adult Education groups, a YWCA, and a Community House. Idea! weight was de- 
termined from a nomograph distributed by the Metropolitan Life Insurance Com- 
pany. In view of the rather large number of obese women who were tested it was felt 
we could best control sampling errors by dividing our population into a “‘pilot 
group”’ and an “experimental group” wherein the results from the pilot group could 
be checked with a second and larger experimental group and those results identified 
which occurred by chance. This gave us two distinct populations of 51 female sub- 
jects for a pilot study and 80 female subjects for the experimental group. 

The pilot group of 51 sujects were randomly selected from 131 volunteers, and 
from a group of 30 obese and 21 ideal weights we established preliminary hypotheses. 
In evaluating the statistically significant variables, however, the use of random 
selection from the total population tested, rather than matching for age, IQ, educa- 
tion, marital status and career vs. housewives left us in the position of not being 
able to say that obesity, our main concern, was the variable which produced the 
differences observed in our statistics. 


*A paper presented at the American Psychiatric Association Regional Research Conference in 
soston, February 28, 1952. 

1A group technique (7) was used as a more convenient, expedient device. Conditions of adminis- 
tration were equal, hence comparable for our purposes. 
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The difference between the means of age, education, marital status, and ea) 
vs. housewives of the two groups of the pilot study were far beyond chance so t} 
along these dimensions our two groups varied significantly. An inspection of Table | 
shows that the difference in age, education, marital status, and housewives 
career women (nearness to the refrigerator) could account for the results.2 Since o 
design involved splitting the parent population into pilot and experimental groups 
the differences created by random selection were not fatal but required correction 
TasLe 1, CHARaAcTERIsTICS OF PrLor Group. 





Obese N =30 Ideal N =21 
Factors Mean Sigma Range Prob. Value Mean Sigma Rang 
Age 42.82 9.87 24-63 .02 34.24 13.73 20-55 
Ed. 11.63 2.12 8-18 .O1 14.7i 3.98 12-20 
I. Q. 115.64 24.69 96-137 .40 122.00 28.84 105-135 
16 married (53%) <.001 2married (10%) 
M. 8. l4 single (47%) 19 single (90%) 
15 career (50%) <.001 | 2leareer (100%) 
Occupation | 15 housewives (50%) 0 housewives (0%) 
TABLE 2. CHARACTERISTICS OF EXPERIMENTAL GROUP. | 
Factor Obese N =46 Ideal N =34 
Mean Sigma Range Prob. Value Mean Sigma Rang 
Age 34.41 7.89 20-50 .50 35.44 9.30 19-50 
Ed. 13.15 1.90 11-18 .50 13.38 2.09 9-17 
I. Q. 118.88 32.43 92-140 .50 119.65 13.52 95-134 
M. 8. 27 married (58%) .70—.80 22 married (65%) 
19 single (42%) 12 single (35%) 
Occupation | 22 career (48%) .70—.80 14 career (41%) 
24 housewives (52% 20 housewives (59%) 


The probability values of the means in Table 2 demonstrate clearly that the 


loose ends of random selection were corrected by a close matching of our subjects 
in the experimental group. This increases greatly the probability that any differenc: 
we can demonstrate on the Rorschach can be attributed to weight for the oth: 

controllable dimensions have been equated. 

The pilot sample was subjected to extensive statistical scrutiny using chi-squa 
and t-tests. The latter was employed for those variables involving percentag: 
where the mean percent represented the most efficacious manner of dealing wit! 
the Rorschach variable. Seventeen general Rorschach formal and content categori 


*The authors wish to acknowledge the statistical advice of Professor George Miller of the Mass 
) — g 
chusetts Institute of Technology. 
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were subjected to chi-square analysis, but by using a refined breakdown a total of 
{6 variables were tested. The range was from ‘“‘number of responses per card,”’ to 


“consistency of M:Sum C, FM:Fe, R to last three cards,” to “‘type of W response.”’ 


Eleven t-tests were run: W%, D%, F%, expanded F°%, F plus ©, A%, % to last 
three cards, R total, 1Q, Age and Education. The probability value of .20 level of 
confidence was utilized as a cut off point in the pilot group for the construction of 
hypotheses. As the direction of the difference between the obese and the ideal 
weights was to be predicted from the hypotheses, a one-sided test of significance was 
employed. This allowed us to posit trends of .10 to .06 in our experimental group. 


RESULTS 

Fifty-seven empirical hypotheses were considered. Chi-square comparisons 
yielded nine specific Rorschach hypotheses for obesity: M less than two, greater sum 
© than M, no recreation content, less H, more CF than FC, no botany, no clothing, 
no lamp responses, and more F— responses. The ¢-tests contributed five additional 
hypotheses: W% less than 36.46, D% more than 56.17, F°> more than 60.83, ex- 
panded F% more than 62.83, and A% more than 41.30. 

However, when these fourteen hypotheses, derived from the pilot group, were 
applied to the new population of the experimental group, seven hypotheses emerged 


TaBLe 3. A. CxriI-SQuARE COMPARISONS 











Pilot Group Experimental Group 
(Obese N = 30; Ideal N = 21) (Obese N = 46; Ideal N = 34) 

Rorschach Chi-square Degree of Level of Chi-Square Degree of Level of 
Factors values freedom confidence values freedom re — 
M 10.82 2 01 4.87 2 .04 
M:sum C 8.30 2 .02 1.47 2 12 
Recreation 11.45 1 01 10.15 1 001 
H 7.62 ] .O1 1.78 1 09 
FC:CF 6.86 2 .05 0.53 2 .40 
Botany 2.72 1 .10 1.44 1 12 
Clothing 2.27 1 .10 2.02 1 07 
Lamp 1.69 1 . 20 1.38 ] 02 
F- 1.53 2 .20 4.25 2 06 

‘ B. “t’” Test Comparisons 

wr | Level of Confidence “—" Level of Confidence (144P) 

W% 2.68 01 3.29 001 

D% 2.00 .05 3.60 .001 

F% 1.61 .10 2.17 O15 

Exp. F% 2.74 01 2.85 005 





A% 1.93 .05 1.06 17 
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at the .05 level of signifieance or higher, and three additional hypotheses indica: 
of trends at the .10 level or higher. The following seven Rorschach variables 
vived the double weeding out processes and can be considered to be significa 
different for the two groups: no lamp content, no recreation content, F% more t! 
60.83, expanded F%) more than 62.83, M less than two, W% less then 36.46, | 
more than 56.17. A further inspection of Table 3 disclosed three additional trends 
between .06 and .10 level of significance) : less H, more F—, and no clothing responses 

There were several significant negative results as well. First, the mean number 
of responses was the same. This allowed us to proceed statistically much more freely 
than if we had to equate for the number of responses. Second, the color ratios and 
percentages did not hold up, removing a possible discriminating problem out of the 
area of relationships with external objects. 

Table 4 serves to integrate the discrete variables into a composite picture of 50 
subjects selected at random: 25 obese and 25 ideal weights. The significance of eac! 
variable was previously determined in isolation and since the Rorschach scores ari 
never treated in isolation, but rather interpreted in the light of all other categories, 
the authors hesitated to deal structurally with the 7 variables without some measure 
of the frequency of occurrence in a sampling of both groups. Table 4 shows that F% 
and expanded F% (sum of F, FK, and Fe%) are perfectly correlated for all 50 cases 
Since the retention of both signs would not explain more of the variance, F% is de- 
leted for the purpose of arriving at a statistical index of obesity. An average of 3.7 


Tasie 4. Tue Frequency or Rorscnacn Signs wirn Bopy WEIGcnrT. 
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B. Contrroit Sampite (N 
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signs (out of 6 possible signs) was obtained for the obese, and an average of 1.2 signs 
was obtained for the controls. This is an average of 2.46 for the two groups together. 
Using the combined mean as a cutting point, the 50 cases were divided into those 
having 2 or less versus those having 3 or more signs. The appropriate chi-square 
is very significant (chi-square =23.12, with one degree of freedom, level of confid- 
ence far beyond the .001 level). Hence it can be concluded that the occurrence of 3 
or more of these 6 signs in the record of a female subject is strong Rorschach evidence 
that she has the obese personality structure. 

In order to obtain the most inclusive personality description, an inspection of 
Table 4 was made for consistent patterns. Of the obese, cases 2, 4, 6, 11, 12, 13, 14, 
15, 17, 20, 21, 22, and 25 (13 cases) have 4, 5, or 6 of the 6 signs (no controls have 
more than 3). Cases 4, 12, 13, 14, 17, 20, 21, and 22 (8 cases) have 5 or 6 of the 6 
signs and 6 of these 8 extreme cases show lamp none, expanded F% greater than 
62.93, M less than 2, and D% greater than 56.17. 

Discussion 

An inspection of the Rorschach variables suggested that the obese women 
registered the same number of scorable stimuli entering the perceptual system of 
their personality structure as the women within the ideal weight range (same mean 
number of responses). However, the quality of the production between the two 
groups differed significantly in that the obese women restricted their reactions to 
& more concrete, a more arid type of association and appeared to function in a more 
limited sphere (higher “‘ Detail’ %). 
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’ The obese women have more anxiety than the ideal weights./ Because t! 
achieve less mastery over anxiety, they were held in a greater state of tension (hig 
“Form” %). Although the obese women showed as much emotional rapport ¥ 
others (no statistical difference between the two groups in the color ratios), they 
closed less pleasure in object relationships (fewer gave “lamp’’ responses to “ phal! 
areas, e.g. in cards 6, 9, 1, and 2; also fewer gave “‘recreation”’ content). Intolerance 
of tension and/or acting out did not seem to be a distinguishing factor between the 
two groups of women (absence of “‘color’” ratio predominance). This suggests that 
the_obese women use a special vehicle for coping with tension. 

: [ Repression is deeper in the obese women and hence the unconscious content is 
less available. There is an increase in pain and decrease in worldly pleasure. There 
is a higher degree of energy poured into defense and hence more internalization and 
depressive trends (higher ‘‘Form’”’ %). Energies appear to be directed more to the 
area of internal difficulties, repressed drives and memories (lower number of ‘‘ Human 
Movement”’ responses) than relationship to external objects. The obese women 
have less flexible ego boundaries. Their reservoir of energy is more tightly bound 
and not as readily available for constructive use (lower “expanded Form” °,) 
Hence obese women may be expected to react more strongly to traumatic exper- 
iences. Their powers of ego integration are less adequate. The obese women are less 
constructively self-assertive. They resort more to a passive type of mastery and 
show less operating capacity for achieving active constructive goals. The pattern 
of the Rorschach variables approximates most nearly the psychic-economic structure 
of the compulsion neurotic. | 

— An examination of the “trend’’ results indicated a tendency for the obese 
women to show less interest in people (less ‘‘Human”’ responses), a tendency for dis- 
playing a more blurred sense of reality (more Form-minus responses), and a tendency 


to present a, greater disturbance in body image (more obese did not tend to give 
“clothing” responses). Of the latter, following Schilder™, Weiss” stated, ‘‘'It is 
only when we do not feel at ease in our clothing that we fail to extend to it our bodily 
ego feeling, instead feel awkward wearing it, as if it were something foreign attached 
to us. 


SUMMARY 


Group Rorschach protocols of obese women were contrasted with ideal weight 
women controls to determine those characteristics which statistically held up 
through both pilot and experimental groups of 51 and 80 subjects respectively. 
Those variables which survived the double screening process were then interpretively 
combined. A personality picture of obese women was constructed that highlighted 
the characteristics of obese women and which consistently differentiated them from 
ideal weight women. 
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THE DETECTION OF PARANOID TRENDS BY THE EIGHT CARD 
REDRAWING TEST (8CRT) 
LEOPOLD CALIGOR, PH.D. 


New York University — Bellevue Medical Center 


INTRODUCTION 


Machover ®?,,on the basis of clinical experience asserts that paranoid pathology 
can be detected in figure drawings by (1) “‘paranoid signs”, and by (2) direct inspec- 
tion. Fisher and Fisher®?, however, tested both of these assumptions, using the 
drawings of paranoid schizophrenics, and found that “The majority of these draw- 
ings did not fall under the category of paranoid in terms of the criteria implicit in 
each method. Furthermore, it was found that the two separate methods of evaluat- 
ing paranoid trends did not correlate.’’ The present lack of quantification has been 
a major obstacle in the further development of the drawing technique as a diagnostic 
instrument. 

Tue Test 

The Eight Card Redrawing Test (SCRT) is believed to open an avenue of ap- 
proach to quantification—quantification based upon an increase of interrelated con- 
tent material rather than upon an evaluation of detailed “‘signs”’ 

The 8CRT consists of eight 84% x 11 inch sheets of onionskin stapled in a vertical 
pad-like fashion to a heavy white cardboard base of the same size. A thin sheet of 
white cardboard of somewhat larger dimensions is inserted between the bottom 
onionskin and the cardboard. Seven of the eight onionskins are rolled over the top 
and beneath the cardboard base, with the eighth onionskin resting flat on the white 
cardboard insert.. The instructions for drawing * 1 are: ‘‘Make a picture of a whole 
person.’’ The bottom onionskin is then brought forward from beneath the card- 
board base so that it rests flat on the drawing just completed. This presents the sub- 
ject with a clear.tracing image. The instructions for drawing #2 are: “And now, 
you may add or subtract anything you want to this picture of a person, or you may 
change it, or you may leave it alone. Only, make a picture of a whole person.”’ The 
same procedure and instructions are followed with drawings *3 through 8. Each 
time the white cardboard insert is placed beneath the last production made. This 
prevents the subject from seeing any but the last figure produced, thus making any 
changes in the latest drawing a development of the immediately preceding one. The 
average time to administer the SCRT to adults is about 35 minutes. 

The construction of the 8SCRT permits observation of the dynamics of change 
along a continuum. This continuum is believed to be on an increasingly unconscious 
nature. Graphie aspects which the subject at first feels it is mandatory to include 
often tend to gradually become excluded, whereas other graphic aspects tend to 
make a later appearance and to become increasingly reinforced. The graphic changes 
in the SCRT frequently provide objectively observable data which lend themselves 
to content analysis and contribute to insight into the subject’s personality dynamics 
at increasingly unconscious levels.! 

The goals of the present study are: (1) to compare the ability of the SCRT and 
one drawing to detect paranoid pathology, and (2) to determine whether the SCRT 
can detect paranoid pathology at increasingly unconscious levels. 


MetTHODS 


Twenty male subjects, all diagnosed paranoid-schizophrenic on the basis of 
case history, psychiatric diagnosis and psychological testing, were selected by an 


1. A previous study“) has indicated the increasing ability of later drawings to tap the individual’s 
unconscious conception of his own masculinity-femininity identification. 
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alphabetic random sampling technique at Brooklyn State Hospital and given 
SCRT. Their drawings *1 and #8 were then removed from the series and eac} 
mounted individually. 

Three raters who had worked with figure drawings for at least three years, in- 
cluding hospital experience, were shown the forty drawings. Spontaneous notations 
of the patients were covered up. The raters were instructed: ‘Here are some figure 
drawir made by adult males. Your task will be to score them for the presence or 
absence of strong paranoid trends.’’ Scores were + and —. Each rater scored inde- 
pendently of the other. Where disagreements existed between raters I and II, the 
evaluation of rater III was used to determine a mutual score. 

The twenty 8CRT records were then reassembled to include drawings #1 and 
*8. The same three raters were now carefully informed of the construction of the 
SC RT and instructed: ‘Your task is to score these series of drawings for the presence 
or absence of strong paranoid trends. You may use the criteria of content and 
change in sequence of drawings, breaks in form level, appearance of bizarre specifi- 
cations, “‘signs’’, or any other clinical material made available by the series.’’ The 
scores were again + and —. As before, where scoring difference existed between 
raters I and II, the evaluation of rater III was used to determine a mutual score. 

Results obtained were used to determine: (1) the efficacy of the 8CRT as com- 
pared to that of one drawing in the detection of paranoid pathology by direct clinical 
inspection, and (2) whether drawings * 8 expose paranoid pathology at a more un- 
conscious level than do drawings * 1. 

RESULTS 

Results will be presented in two categories: (1) coarctated schizophrenics (show- 
ing primitive, regressed, inflexible drawing characteristics), numbering 9 subjects; 
2) over-ideational schizophrenics (showing non-regressed, overactive phantasy), 
overflexible characteristics), numbering 11 subjects. 


Interscorer Agreement. The agreement between raters I and II for the presence or 
absence of strong paranoid trends for the total sampling (N = 20) was 70% for 
drawings * 1, 80°; for drawings #8, and 90°) for the 8CRT protocols. The agree- 
ment between raters I and II for the coarctated paranoid schizophrenic sampling 
(N = 9) was 66.7% for drawings *1, 77.8°% for drawings *8, and 77.8% for the 
SCRT. The agreement between raters I and II for the over-ideational paranoid 
schizophrenic sampling (N = 11) was 72.7% for drawings #1, 81.8°% for drawings 
*8, and 100°; for the 8SCRT. 

The consistently higher interscorer agreements for the 8SCRT than for one 
drawing suggests the greater stability of the 8CRT as a clinical tool. This is to be 
expected, since the construction of the 8CLXT exposes more graphic data than does 
a single drawing. This tends to place diagnosis on a scientifically firmer footing and 
minimizes the subjective “hunch’”’ factor frequently observed in the evaluation of 
one drawing. 

The consistently higher interscorer agreement for drawings #8 than for #1 is 
believed due to the last drawing of the SCRT graphically exhibiting more clearly 
what may be present to a lesser extent on the first one. This is a result of the in- 
creasingly unconscious nature of the SCRT which tends to minimize conscious eva- 
sion and to reinforce graphic aspects of which the subject is not aware. The increase 
in interscorer agreement from drawings *1 to #8 to the 8CRT suggests that al- 
though later drawings expose strong paranoid trends to a greater extent than do th 
earlier ones, the SCRT reveals even more data than do drawings *8. The difference 
in increase is believed due primarily to the exposure of inter-related graphic content 
by the SCRT which cannot be detected on one drawing. 

The consistently higher interscorer agreement for the over-ideational as com 
pared with the coarctated group was expected. Their records tended to expose con 
tent, breaks in form level, bizarre specifications, etc., on the later drawings, whereas 
the coarctated group tended to rigidly reproduce the same figure. 
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The Detection of Strong Paranoid Trends. The final allocation of strong paranoid 
trends for drawings * 1, *8 and the SCRT are presented for the coarctated (Table 
la) and overideational paranoid schizophrenic groups (Table 1b) separately. 


TaBLE 1. Frnat ALLOCATION OF StrRoNG PARANOID TRENDs ON Drawtnas No. 1, No. 8 AND THE 
8CRT For a ParaNoip ScnIZzOPHRENIC SAMPLING 


Ia - Coarctated N =9 Ib - Ideational N = 11 


Drawing Drawing Drawing Drawing 
Subject #1 #8 8CRT Subject #1 #8 


+ + 


+% 44.4 \| 66.7 ; + % 9.1 72.7 90.9 


Of the total sampling (N = 20), 25°7 were scored plus on drawings ¥ 1 for the 
presence of strong paranoid trends, 70°) on drawings *8, and 85°) on the 8SCRT. 
Of the coarctated paranoid schizophrenic group (N = 9), 44.49) were scored plus 
on drawings * 1, 66.7°% on drawings *8, and 77.8°% on the 8CRT (Table la). Of 
the over-ideational paranoid schizophrenic group (N = 11), one subject, 9.1%, was 
scored plus on drawing #1; 8 subjects, 72.79%, on drawing #8; and 10 subjects, 
90.9%, on the 8CRT (Table 1b). The only subject scored minus on the 8SCRT was 
scored so by both raters. 

For the total sampling, the 8SCRT was 3.4 times as good as one drawing in its 
ability to detect strong paranoid trends. This was due to the 8CRT having fre- 
quently exposed interrelated content, overideation, severe breaks in form level, 
bizarre specifications, paranoid ‘‘signs’’ such as overemphasis on eyes, etc., not ob- 
servable on drawing * 1. 

In the coarctated group, 4.9 times as many, were correctly diagnosed for strong 
paranoid trends on drawings #1 because of their regressed and constricted graphic 
characteristics. However, the over-ideational protocols for the entire 8CRT per- 
mitted the detection of strong paranoid trends 1.2 times as frequently as did the 
coarctated records for reasons stated above. 

The detection of strong paranoid trends on drawings # 1 of the over-ideational 
paranoid schizophrenics was extremely limited, only one subject of 11 having been 
scored plus. This wus due to their successful graphic evasion, maintenance of good 
form level and the iack of bizarre specifications on one drawing. However, the fre- 
quent reversal of these factors and the exposure of inter-related graphic data on the 
8CRT made for greater acuity in the detection of strong paranoid trends. This ex- 
plains why only one subject in his group (9.1°)) was scored plus on drawing #1, 
whereas 10 subjects (90.9%) were scored plus on the entire 8SCRT. 
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The Detection of Strong Paranoid Trends at Increasingly Unconscious Levels. The in- 
crease in the detection of strong paranoid trends of drawings *8 over #1 was 2.8 
for the total, 1.5 for the coarctated and 8.0 for the ideational sampling. The con- 
sistent increase indicates that later drawings detect strong paranoid trends to a 
greater extent than do the earlier ones. Due to the 8SCRT rationale of the increasing- 
ly unconscious nature of later drawings, these results were anticipated; later draw- 
ings display graphically more of the true personality dynamics and less of the con- 
scious evasion frequently found on drawings *1. This was obviously more so for 
the over-ideational than for the coarctated sampling for reasons previously stated. 
The detection ‘of strong paranoid trends was consistently greater on the 8SCRT 
than on drawings #8. The increase of the SCRT over drawings *8 was 15% for 
the total, 11.1, for the coarctated and 18.2% for the over-ideational sampling. This 
is due to the exposure of interrelated content which no one drawing could detect and 
the tapping of increasingly unconscious levels. 


DISCUSSION 


It must be emphasized that any statistical approach to the 8SCRT does the 
technique injustice as a diagnostic instrument since its major strength is the graphic 
portrayal of personality dynamics. For example, the over-ideational SCRT protocols 
frequently showed evidence of homosexual panic, confusion in masculinity-femin- 
inity identification, confusion of sexual and religious symbols, etc. The presence of 
hostility and paranoid dynamics usually increased on the later drawings. In the co- 
arctated records, once a detail such as an eye was stressed, the emphasis increased 
until it fy equently reached bizarre proportions. The spontaneous verbalizations of 
the raters indicated that where drawing *1 clearly revealed paranoid symptom- 
atology, the 8CRT was still deemed of value for its portrayal of the underlying 
psychodynamics. 


SUMMARY 


The purpose of this paper is to illustrate the eight ecard method Redrawing 
Test. It is hoped that the method of, the SCRT will contribute toward the fuller 
utilization of figure drawings as a diagnostic instrument. Because of the structure 
of the SCRT and its completely non-verbal nature which lowers the possibility of 
evasion, personality dynamics are frequently exposed at increasingly unconscious 
levels—personality dynamics which are graphically presented by the subject rather 
than subjectively divined by the clinician. This has been illustrated by a series 
of 8SCRT in the appendix. 
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THE EFFECTS OF TWO SIMULTANEOUS COGNITIVE AND AFFECTIVE 
STIMULI 


EMANUEL STARER, PH.D. 


V. A. Hospital 
Coatesville, Pennsylvania 


INTRODUCTION 


This investigation is an exploratory attempt to study the effects of mild affective 
stimulation on psychomotor performance. Although the approach of employing 
simultaneous stimuli is not new and had been used in early conditioning experi- 
ments, the setting in which the present technique is employed is apparently an in- 
novation. This particular research had been suggested by Malmo, Shagass, Bel- 
anger, and Smith whose results led them “‘to question the neglect of motor aspects 
of behavior in most studies dealing with personality, and, especially, with its path- 
ological manifestations in psychiatric disorders.” Using a control group of 21 
normals and 76 psychiatric patients, and a technique of employing a standardized 
series of painful stimulations (non-specific) they arrived at the conclusion that under 
this form of non-specific stress, psychoneurotic patients were characterized parti- 
cularly by disturbances in motor function, this being evidenced by impaired motor 
control and excessive muscular tension. Their results, in general, supported the 
hypothesis that psychoneurotic disorders involve defective motor regulatiou mani- 
fested by abnormally increased motor disturbance under any stressful condition. 


METHOD AND PROCEDURE 


The control group consisted of 15 female affiliate student nurses, age ranged 
from 19-24, with the mean age of 20, and functioning on at least an average intelli- 
gence level. No psychiatric deviations were evident clinically. No difficulty in 
hearing was present. 

The experimental group consisted of 15 female psychiatric patients, all diag- 
nosed as psychotic. The age range consisted of 23-39 years, with the mean of 30.7 
years. Three were diagnosed as manic depressive; seven as schizophrenics, paranoid 
type; three as hebephrenic type; one as catatonic type, and one as mixed schizo- 
phrenic type. Two of the patients had improved to the point of being considered 
ready to leave the hospital, and three others displayed considerable improvement in 
their behavior. At the time of testing, all were capable of functioning on at least an 
average intelligence level, as indicated by their performance on the Wechsler- 
Bellevue Intelligence Scale. No difficulty in hearing was present in any of the 
patients. 

The subjects in both the experimental and control groups were given the Bende: 
Gestalt designs to copy on 8% in. by 11 in. white paper. They were all provided 
with two well-sharpened pencils with erasers. The standard instructions were: 
‘Please copy the following designs as well as you can on the paper.” If the subjects 
wanted more paper, they were given it quickly. The time needed for each design 
was noted. 

After the completion of the copying of the Bender Gestalt designs, the subjects 
were given a five minute rest. Then a recording which had been made on a standard 
dictaphone machine was played to each of the subjects. It contained the following 
instructions: 


“Please listen to the following instructions. You are going to be given a serie: 
of designs to copy as well as you can on the paper. While you are copying them, yor 
will hear me speaking to you. Continue copying the designs no matter what you hea: 
Do not stop copying the designs at any time. Are you ready? (Pause of 10 seconds 
All right, now begin copying the design. (Pause five seconds). 
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‘“‘Now you can hear my voice. Remember to continue copying no matter what 
you hear. I am going to ask you a series of questions. Do not try to answer them at 
any time. Just continue copying the designs. (Pause 10 seconds). 

1. Do you feel sick at the present time? 
When you ‘vere a child, were you ever happy? 
Did you sumetimes dislike what your mother did? 
Were your father and mother happy together? 
Did you ever feel that your father was friendly to you and liked you? 
Did you feel close to your mother? 
Did you ever feel jealous of your brothers or sisters or your parents? 
Did you make a success in your job? 
Did you ever feel like you wanted to kill yourself? 
Do you ever get very angry and annoyed with your boy friend or husband? 
11. Do you enjoy intercourse or sex activities? 
12. Are you happy with your boy friend or your husband? 
13. Do you want to get well? 
14. Do you feel you can get better? 


These questions were spaced 15 seconds apart. If the subject did not complete 
the Bender Gestalt designs by the time the last question of the series was heard, the 
questions were again presented from the beginning. During the testing, the subjects 
were carefully observed at all times. The number of the question which coincided 
with the different Bender Gestalt designs was noted and recorded. 

After the subjects had completed the copying of the designs, accompanied by 
the added stimulation of affective questions, they were told to write down as many 
of the questions which they heard that they could remember. 

The Bender Gestalt designs were scored for deviations according to the Pascal- 
Suttell©) method. In addition, Design A was also scored in a manner similar to 
that prescribed for the remaining eight designs. The investigator found that this 
design could not be omitted from the scoring in view of its importance for testing 
of possible distortions or rotations. 


{ESULTS 
A comparison was made of the results of the experimental and control groups 
for the following categories: differences in the Bender Gestalt designs before and 
after the playing of the recording containing affective stimulation; differences in 
time reaction before and after affective stimulation; the number of questions re- 
called in each group, and the type of behavior during the testing. 
TABLE 1. CHANGES IN BENDER GEsTALT PERFORMANCE AS MEASURED BY THE 
PascaL-SuTTELL METHOD AFTER AFFECTIVE STIMULATION FOR EXPERIMENTAL 
AND ConTROL GROUPS 


Experimental Group Control Group 
T=15 N 


Improvement 
Same 


Worse 


The results in evaluating the deviations on the Bender Gestalt figures both 
before and after affective stimulation are shown in table 1. The Chi Square Method 
shows that the differences between the groups are significant at the one per cent 
level. It is interesting to note that those cases in the experimental group which 
showed no change also were considered to be clinically improved, and two of the 
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patients are considered sufficiently recovered to leave the hospital. The psychom« 
disorganization in the experimental group was at times very severe. Rotations, 
tortions, perseveration, inappropriate planning, and refusing to complete the t 
were present. It was possible for the investigator to note which of the affective 
questions was helping to produce this excessive disorganization and to better under- 
stand the sensitive areas. 

M . With respect to differences in time reaction in the copying of the Bender 
Gestalt designs before and after added affective stimulation, it was found that th 
control group had a record of 11 cases or 73.3 per cent where the time taken on thy 
second performance was either the same or less than the time taken to complete the 
copying on the first performance. This was markedly different from the result in 
the experimental group where only four cases or 26.7 per cent were able to complete 
the task in the same or less time than the initial performance. This again points to 
the presence of greater blocking in the experimental group under the pressure of 
affective stress. 


TaBLe 2. Recaut or AFFECTIVE StREss BY EXPERIMENTAL AND CONTROL 
GROUPS 


Number of 
Groups Questions Recalled 


Mean 


Experimental 


Control 





Differences in the recall of the questions asked during the added affective stim- 
ulation performance were noted. The control group had a range of four to nine 
questions recalled accurately with a mean of six, as compared to the experimental! 
group having a range of zero to ten questions recalled with a mean of 4.2. No dis- 
tortions were noted in the control group. In the experimental group one case showed 
extreme distortions in recall and two other cases reported that they couldn’t re- 
member any of the questions asked. It was of importance to note that in one of the 
cases in the experimental group where the psychomotor disorganization was mini- 
mal, disorganization was apparent in the recall process. This patient reported that 
she could not remember any of the questions at all. 

With respect to the behavior during the experiment, no unusual or bizarre 
was noted in the control group. In the experimental group six of the cases showed 
marked inappropriate behavior. Examples of this included: stopping in the middle 
of a task coincidental with the presented affective stimuli and displaying extreme 
inattention, responding to the dictaphone as if it were a real person, inappropriate 
grinning and smiling when affective stimulation was presented, disturbed looks ac- 
companied by screwing up of the forehead or excessive frowns, and responding to 
the questions overtly which was contrary to the directions. One patient who could 
no longer restrain herself said in a very loud voice during the affective stimulation 
“Yes, I’m sick, I’m sick.”’ This behavior was not apparent during the initial testing 
but became easily discernible in six of the cases of the experimental group after the 
affective stimulation was applied. The results are consistent with those obtained 
in the Malmo, Shagass, and Belanger Smith“ study. They tend to suggest that 
further studies along similar lines are needed. 

Although the technique used in the present experiment is subject to possibl: 
revision, the basic approach of employing simultaneous impingement of stimuli in 
clinical setting is worthy of careful consideration, and merits more extensive r 
search and the author would appreciate the forwarding of any additional data « 
results of investigations which are the same or similar in design. 
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SUMMARY 

Two groups, a control and experimental, copied the Bender Gestalt designs 
once and then a second time under the additional stimulation of affective stimuli in 
the form of personal questions. The control group consisted of 15 affiliate nurse 
students with a range of 19-24 years and a mean of 20 years. The experimental 
group consisted of 15 female psychiatric patients, all of whom had been diagnosed 
as psychotic, with an age range of 23-39 years and a mean of 30.7 years. Differ- 
ences in the performance between the two groups included psychomotor disorganiza- 
tion, delayed reaction time, some difficulty in recall of the affective stimuli, and be- 
havioral disorganization in the psychiatric group when compared with the control 
group. 

The suggested technique of impingement of simultaneous stimuli, one of which 
has affective meaning, is worthy of further investigation, particularly since in the 
clinical setiing it may give a clue to underlying dynamics, and serve to indicate 
how the individual actually reacts to a relatively mild stressful situation. 
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THE TIME-LIMIT VERSION OF THE WORD-ASSOCIATION TEST* 
J. P. S. ROBERTSON 


Netherne Hospital, Coulsdon, England 


INTRODUCTION 

A group form of the word-association test had wide use during the Second 
World War in the selection procedures of the British Army, as described by Harris ° 
and Vernon and Parry“. More recently the same form has been employed in 
Britain in the selection procedures of other institutions and has been introduced into 
some mental hospitals as a diagnostic method in abnormal psychology, as described 
by Robertson“. In this form of the test S writes his own responses. He is given a 
time-limit in which to do so but his RT to the stimulus-word is ignored. He is ex- 
pected to respond not by one word only but by a phrase, a sentence or a series of 
words. He is also expected to write as much as he can within the time-limit. 

This form of the test has come to be known as the group word-association test 
in contrast to the traditional individual word-association test from which it was 
derived. Its essential difference from preceding forms, however, lies not in its being 
a group test, since group forms of the word-association test have been employed 
before, but in fact that S is given a time-limit and expected to reply by a phrase or 
sentence, not by a single word. For this reason it can more distinctively be termed 
the time-limit, as opposed to the traditional single-word, version of the word-associa- 
tion test. 


COMPARISON OF SINGLE-WORD AND Time-Limit VERSIONS 
The circumstance that S writes as much as he can within the time-limit trans- 
forms the nature of the test. The collection of written responses gives much more 


*The author has incorporated some material included in a doctoral thesis accepted by the Uni- 
versity of London in 3950. 
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abundant evidence than-do single-word responses about the personality of S, b 
on his chosen modes of formulation and on the attitudes that are elicited from | 
This may be exemplified by quoting blocks of single-word and time-limii 
sponses to the same stimulus-words given by the same psychoneurotic 8. The ti 
limit responses were obtained five weeks after the single-word ones. 
Stimulus- One-word RT Time-Limit response 
word response 
TRUNK Travel 
DRINK Beer 
PARTY People 
BOOK Shelf 
LAMP Light 
RUG Fire 
CHAIR Sedan 
BOY-FRIEND _ Girl-friend 
DARK Horse 
DEPRESSED Feeling 
SPRING Flowers 
BOWL Roses 
SUIDICE Death 
MOUNTAIN Valley 
HOUSE Home 


The trunk murders before the war 
I don’t like beer. 

I’m shy at parties. 

I read a lot. 

Streets at night. 

The rugs at home. 

A comfortable easy chair. 

It seems an American word. 

I used to be afraid of the dark. 

I am often depressed. 

Spring is the best time of the year. 
Old men play bowls. My father 
I would never commit suicide. 

I am interested in geography. 
We have got a nice house. 
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It will be-seen that the time-limit responses present a clearer and fuller picture 
of S’s attitudes, outlook and modes of reaction. They are more specific, more un- 
ambiguous, less superficial and give more scope for the display of individual differ- 
ences in attitude. Their only disadvantage is in not providing the measure of dis- 
turbance given by RT in the single-word responses. The difference between the 
responses obtained by the two methods is not always so marked as in these examples 
but the trend is unmistakable. 


ADMINISTRATION OF TIME-LimiIt VERSION 


In order to obtain good results with the time-limit responses careful test in- 
structions are necessary, emphasizing that there are no correct answers and that 8 
can express himself in any way he likes, except for the following. 

Something must be written for every stimulus-word, leaving no blanks. 
As much as possible must be written within the time-limit and one-word 
responses must be avoided. 

It is not a vocabulary test, so that S is not being asked to write down what 
the words mean. 


It is helpful at tie beginning to quote examples of possible responses, e. g. “I might 
give the word DOU. You might write down My spaniel at home or I love them or 
I prefer cats or Fox-terriers. Bulldogs. Retrievers. Collies, or A dog running after a 
rabbit or dozens of other things.” 

The optimal time-limit is probably 20 seconds but with aged or seriously dis- 
ordered patients a minute may be necessary and with young, alert S’s 15 seconds are 
ample. It is essential to maintain the same time-limit throughout. A visual pre- 
sentation of the stimulus-words is preferable to an auditory. The test is most success- 
ful when administered to a group but can also be applied individually. It is very 
undesirable in the latter case for the tester to write the responses. 


Mopes or INprvipuAL DiFrFERENCE IN PERFORMANCE 
The chief lines along which individual differences appear in the time-limi' 
responses are the following. 
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The general verbal formulation of responses. Phrases, sentences, word chains, 
completions, ete. 


The particular verbal formulation of responses. Queries, exclamations, com- 

mands, etc., or the use of negatives, abstract nouns, passive verbs, words 

such as “‘must’’, “ought”’, “‘wish”’, “‘try’’, “‘I’’, “‘you”’, “we’’, ete. 

S’s general way of approaching the task set by the test. Factually, introspective- 

ly, valuatively, verbally or irrelevantly. Examples of these are: 

(factual) CHILDREN They spend a lot of time in school. 

(introspective) CHILDREN A feeling that I want to protect 
them. 

(valuative) FATHER A real good sort. One of the best. 

(verbal) FATHER Eternal Father strong to save. 

(crrelevant) FAILURE Bones in my arm. 


The attitudes and opinions elicited by particular stimulus-words. Elicited atti- 
tudes as opposed to formal characteristics depend largely on the word-list 
used. With mental-hospital patients it has been found useful to employ 
words covering the following fields: family relationships; social relation- 
ships; emotional states; sex and marriage; body parts; disease; death, fear 
of death and suicide; and miscellaneous trends, such as suspicion, anxiety, 
aggressiveness, low self-valuation, fussiness, etc. 

Special contents and references not elicited by particular stimulus-words. Allu- 
sive responses, quotations, current-event responses, personal-name re- 
sponses, place-name responses, pre-occupations, etc. 


Many individual differences along these five lines appear to be correlated with 
differences in personality and psychopathology. It is hoped at a future date to pub- 
lish evidence bearing on this from hundreds of cases tested at Netherne Hospital. 

The time-limit test is particularly successful in evoking the deviant modes of 
verbal expression and thinking characteristic of schizophrenic reactivity. Body 
words, e.g. BLOOD, STOMACH, etc., are useful in the body-schema aspect of this 
problem. The following are examples of responses showing the thinking disturbances 
commonly described as “‘schizophrenic”’ 


ENVY Shame with a good heart to it. 

ENVY She is the next sister to jealousy. 

BODY Body odor double fear. 

BODY Skeleton holding a machine. 

BLOOD Red and white corpuscles flow along the mind. 
BROTHER Perhaps my brother is butter, better than margarine. 
STOMACH The pro’s and con’s of peas and cream 
SUCK To take in by convulsions and electric fits. 
HUSBAND I believe this is always a male. 

ANGER My brain bursts out of my skull. 

MOUTH The groceries’ enemy. 


CoMPARISON WitrH SENTENCE-COMPLETION TESTS 

The time-limit word-association test has obviously many features in common 
with sentence-completion tests as described by Shor’, Rotter and Willerman“? and 
Bell). As contrasted with these, it is more effective in disclosing the basic modes of 
approach which are unwittingly expressed in verbal formulations and in bringing 
to light thinking disturbances. It is also more speedily administered to disordered 
persons. It is less successful in pinning the patient down to the expression of an atti- 
tude on a particular topic. The two tests, however, conveniently supplement one 
another when combined in a battery, the time-limit association test being given a 
group administration and the sentence completion test a more leisured individual 
administration subsequently. 





J. P. 8S. ROBERTSON 


SUMMARY 
A brief account is given of the history, nature and administration of the tin 
limit version of the word-association test and of the lines along which inferences ca 
be made from it as to personality and psychopathology. 
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EFFECT OF TEST-RETEST INTERVAL AND ITEM ARRANGEMENT ON 
THE SHORTENED FORMS OF THE MMPI 
GORDON L. MACDONALD 
Toledo State Hospital 


This study investigated two aspects of test administration connected with the 
MMPI. One was the effect of the time interval between test and retest, and the 
other the effect of item arrangement on the individual form of this inventory. It 


was hypothesized that either, or both, of these might have been responsible for the 

low statistical values obtained for validity and reliability in an earlier study“). 
PROCEDURE 

Groups. There were four groups used in this study: (1) A mixed population of col- 

lege students and student nurses consisting of 50 males and 67 females, (2) a sub- 


group of 24 student nurses from the latter population, (3) a group of 10 student 
nurses, and (4) the control group of 25 student nurses. 


Forms. The two forms of the MMPI were shortened by eliminating all of the un- 
scored items including the sixteen duplicated items on the group form. The second 
appearances of these items were masked out. The scorable items on the grcup form 
stop at number 365 with seven items past that number being scored for the K scale. 
The items on the individual form were arranged in identical order with those on the 
group form. They were rearranged in this same order after each scoring. 


Test Administration. The control group of 25 student nurses took both forms of 
the test at one session with the forms being alternated in presentation. The items 
of the individual form were rearranged in the same order as the group form after 
being scored. The mixed population of 117 and the sub-group of 24 student nurses 
had a one week interval between tests with the items on the individual form being 
randomized for each presentation. The group of 10 student nurses took beth forms 
of the test at one sitting with the individual form items being randomized. 
Statistical Analysis. Coefficients of correlation for test-retest were computed as were 
the critical ratios of the differences between the means of test-retest results for th 
control group of 25 student nurses. The comparisons of the groups were by the 
critical ratio technique. 

RESULTS 
Control Group. The coefficients of correlation and the critical ratios obtained f 
the test-retest comparison of the control group of 25 student nurses are given | 
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Table 1. The coefficients for the L, F and Mf scales for both raw scores and T-scores 
are of such magnitude that predictions of individual’s scores could be reliabiy under- 
taken. The critical ratios of the differences between the means is statistically in- 
significant for all scales and both scoring methods. 

Control Group Compared With Mixed Population. The critical ratios obtained for 
both forms and both scoring methods for the comparison of the control group with 
the mixed population of 117 individuals is shown in Table 2. The C. R. for the Mf 
raw scores, and the K scale T-scores (individual form) are significant at the 1°, level. 
The Hy scale T-seores (individual form) have a C. R. that is significant at the 5°% 
level. The Mf scale raw scores and the K and Mf scales T-scores (group form) have 
critical ratios that are significant at the 1% level. The K scale raw scores and the 
Hy and Sc scales T-scores show C. R.’s. significant at the 5% level. All of the other 
C. R.’s. for the comparisons of these groups are statistically insignificant. 

TaBLE 1, COEFFICIENTS OF CORRELATION AND CriTICAL RATIOS OF THE DIFFERENCES BETWEEN 

THE Means For Test-ReETEsT COMPARISON FOR TWENTY-FIVE STUDENT NURSES 








r r C.R. C.R. 
Raw Scores T-Scores Raw Scores T-Scores 


.846 : 141 .031 
.847 S78 1.388 .893 
.795 .677 .067 .172 
.508 .625 . 239 .630 
.844 57 .370 211 
.606 .60¢ .428 .428 
. 503 . 62: 678 .032 
. 894 89; 595 .629 
.722 «de . 528 522 
.819 6 694 .036 
Se . 847 .562 368 584 
Ma .519 oe .641 356 





TABLE 2. CriticaL Ratios OF THE DIFFERENCES BETWEEN THE MEANS COMPARING THE MIXED 
PopuLaTION WITH THE CoNTROL GrovUP. 


Group 
Form Form 


T-Scores 





Raw Scores T-Scores Raw Scores 





1.395 .421 1.250 .324 
.150 .276 1.105 .146 
.729 .387 2.901 5.068 

.369 .012 .942 
.167 1.162 251 
873 2.388 854 
.119 .424 945 
.408 3.637 .939 
.032 .891 . 869 
.791 476 420 
.679 .828 2.536 
1.899 .874 2.287 5.214 


Comparable Groups of Student Nurses. The critical ratios of the differences between 
the means for both forms and scoring methods derived from comparing the control 
group with a group of 24 student nurses is shown in Table 3. The C. R.’s. for all 
scales of both forms and both scoring methods are statistically insignificant. 


Time Between Tests Constant. The C. R.’s. of the differences between the means for 
both forms and scoring methods for the comparison of the control group with a 
group of ten student nurses is shown in Table 4. Both forms of the test were admin- 
istered at one session to all subjects in both groups. The C. R.’s. on all scales for 
both forms and both scoring methods are statistically insignificant. 
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TaBLe 3. CriticaL Ratios OF THE DIFFERENCES BETWEEN THE MEANS COMPARING THE Sus-Gro 
or 24 Srupent Nurses Wir THE ContTrROL Group 


Individual Group 
Form Form 


Raw Scores T-Scores Raw Scores T-Scores 


840 045 .173 

545 .770 1.708 

000 471 .523 

345 210 1.330 

919 125 1.743 

Hy 371 . 762 1.934 

Pd 214 776 384 

Mf 463 323 . 832 

Pa 1.229 336 .571 

Pt 1.807 5 2.373 

Se 1.520 : 1.834 k 
Ma 853 6 .145 1.654 


L 
F 
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Hs 
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TaBLe 4. Criticat Ratios OF THE DIFFERENCES BETWEEN THE MEANS COMPARING THE GROUP OF 
10 SrupEeNntT Nurses WITH THE ConTROL Group. 


Individual Group 

Form Form 

Raw Scores T-Scores Raw Scores T-Scores 
661 018 677 .329 
345 646 1.785 2.073 
034 646 1.263 .768 
.794 363 1.490 . 130 
171 322 2.002 1.942 
012 577 193 .219 
567 475 1.053 476 
.189 143 618 . 706 
594 504 1.786 1.853 
645 2.227 .948 
.484 1.662 456 
.598 648 240 


SUMMARY AND CONCLUSIONS 

The control group shows less discrepancies in performance from one form to the 
other, as indicated by the C. R.’s., than did the mixed population in the earlier 
study“. While the ¢oefficients of correlation have been raised in some’ cases (L, F, 
Mf scales) to the level where individual predictions could be attempted more re- 
liably, the general improvement in performance has not affected the other scales to 
the same extent. The results of this comparison of two forms composed of identical 
items arranged in identical order and administered at the same session leave much to 
be desired in a test to be used in appraising personality. 

The comparison of the control group with the other groups shows that the im 
provement in test performance of the control group can be considered as statistically 
insignificant. Those few statistically significant differences that did occur can prob- 
ably be attributed to differences in performance between the sexes found in th: 
earlier study“? of the mixed population. ; 

These studies show that item arrangement on the individual form and the time 
interval between test and retest did not affect performance significantly. The dis 
crepancies of the earlier study cannot be attributed to these factors. 
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THE CONSISTENCY OF LONGITUDINAL PERFORMANCE ON THE 
GRAHAM-KENDALL MEMORY-FOR-DESIGNS TEST 


RENATE GERBOTH ARMSTRONG 
Alton State Hospital, Alton, Illinois 


INTRODUCTION 


A pilot study on the Graham-Kendall Memory-for-Designs test was conducted 
to compare the difference between the performance of a group of schizophrenic and 
a group of organic patients. The purpose of the study is to determine whether the 
performance of schizophrenics varies significantly initially and longitudinally from 
organics. As far as could be determined the only published research on this test was 
done by its authors several years ago. The test consisted of 15 simple geometric de- 
signs, each of which is exposed to the subject for 5 seconds, after which he is asked 
to reproduce it from memory. Each design is scored on a four point scale ranging 
from 0 - 3. A score of 0 is given to a satisfactory reproduction; a score of 1 is given 
when more than 2 easily identifiable errors have been made but the gestalt is re- 
tained; a score of 2 is given when the gestalt has been lost; and a score of 3 is given 
when the design has been reversed or rotated by 90 or 180 degrees. 

PROCEDURE 

The subjects consisted of 25 organic and 25 schizophrenic Alton State Hospital 
patients, both groups comprising a rather heterogeneous sample. Since this project 
is only a preliminary study conducted in a limited period of time, no further variables 
were taken into account. Each subject was given the test 3 times in about a 10 day 
period. The materials were then scored and the statistical results evaluated. 


RESULTS AND DiscussiOoN 

The schizophrenic group had a mean score for the 3 performances of 7.33 
(8.D. 5.13) and the organic patients had a mean score of 14.87 (S.D. 7.29), i.e. the 
mean for the organics was more than twice as high as that of the schizophrenics. As 
to be expected then, a much greater amount of deterioration exists in the organic 
than in the schizophrenic patients. This difference was significant at the 1% level 
of confidence. Considerable overlap exists between the lower ranges of the schizo- 
phrenic group. Individual diagnosis, therefore, can only be completely acceptable 
at present in the upper one-third of the total test score range. With larger and more 
homogeneous sampling, finer distinctions may very well be drawn. 
TABLE 1. CoMPARISON OF PERFORMANCE OF * ORGANIC AND 25 ScuIzoOPHRENIC PATIENTS ON THREE 

RETESTS 











Mean Sigma Range 


Schizophrenic Group 33 5.13 0-23 
Test 1 .52 5.26 0-19 
2 32 6.43 0-23 
3 5.16 5.79 0-21 


Organic Group 14.87 .29 0-31 
Test 1 .00 83 0-31 

2 14.80 3.05 0-30 

3 13.80 9.73 0-30 

















t sig. 
Difference between Sch. and Org. Means 4. 01 
Difference between Sch. and Org. Mean Differences 0.88 .50 
_ _ "Now at East Moline State Hospital, Illinois. The author wishes to express her thanks to Melville 
Finkelstein, Sylvia Finkelstein, and Ruth Westerhold for their help and suggestions. 
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The statistical results show that within the organic group, Test I is 89% 
variable as Test II, but only 69°% as: variable as Test IIl. In the schizophrenic 
group, the percent of variability is much less: Test I is 70°% as variable as Test II 
and 65°; as variable as Test III. The organics showed a steady improvement from 
the first performance to the last. The schizophrenics, on the other hand, gave a 
much better initial performance but then failed to improve noticeably upon retesting, 

The differences between the schizophrenics’ M,;—Mehas a ¢ of 0.71, M,—Ms has 
a t of 1.48, and M.~M; has a ¢ of 0.66; the difference between the organics’ M,—M, 
has a t of 0.52, M,-Ms; has at of 0.86, and Me—Ms; has at of 0.39. The difference be- 
tween the three means of each group are not statistically significant. However, the 
schizophrenic group shows more inter-test variability than the organic as seen by 
the average difference between the individual performance; that of the schizo- 
phrenics is 2.23 (S8.D. 1.82), and that of the organics only 1.86 (S.D. 0.97). Con- 
sidering the great differences between the means of the two groups, one would expect 
the opposite to be in effect. 

The standard deviations of the separate means and the combined means of the 
two groups suggest that the performance of the schizophrenic is by far more erratic 
than of the organic group. The standard deviations are relatively close, while the 
range and means vary widely. 


SUMMARY AND CONCLUSION 


After testing two groups of twenty-five patients three times with the Graham- 
Kendall Memory-for-Designs Test it was found that the performance of the organic 
group is, in general, much poorer than that of the schizophrenic group, indicating a 
more extensive deteriorative process. The overlap between the higher range of the 
more deteriorated schizophrenics and the lower range of the less deteriorated organ- 
ics makes individual diagnosis tenuous. However, the retest may clarify this overlap 
by the amount of variability revealed. 

Although we found consistent trends as far as variability of performance is con- 
cerned, we cannot make a conclusive statement as to their cause. For example, the 
apparent improvement upon retesting the organics and their lesser intertest variabil- 
ity could be attributed to several causes. One could assume that these factors may 
be due to their less erratic behavior as compared to the schizophrenics, or to their 
greater motivation to improve their performance, and /or due to learning effect. On 
the other hand, there is a Jack of improvement upon retesting in the schizophrenics 
and a greater intertest variability. With larger and more homogeneous groups further 
validations may well be obtained, and the underlying reasons and causes for such 
performance may then be fully exposed and substantiated. It is then that the Gra- 
ham-Kendall will assume greater value and reliability as a diagnostic tool. 
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SUGGESTIONS FOR EFFICIENT PRESENTATION OF THE WECHSLER- 
BELLEVUE OBJECT-ASSEMBLY SUB-TEST 


WALTER SHANNON AND PHILIP D. ROSSI 


St. Lawrence University 


As every psychologist knows, all of the Wechsler-Bellevue test materials are 
subject to ease of presentation with the exception of the object-assembly sub-test. 
The traditional presentation of the object-assembly mutilations has always been 
awkward, time-consuming, and tending to disrupt rapport. However, by inserting 
the individual pieces in a specially constructed formboard as illustrated and des- 
cribed in this article, the psychologist will have a quick and efficient means of pre- 
sentation. A detailed description of how the formboards may be constructed is 
offered although a study of the photographs in most instances should suffice to 
demonstrate how construction can be effected. 


Figure 1 illustrates how the pieces fit in the formboard already in their proper 
places. Figure 2 illustrates how the pieces will appear to the subject once they are 
removed, and also illustrates the appearance of the formboard with pieces removed. 
Figure 3 illustrates how the panel is removed te allow speedy and accurate array- 
ment of the mutilations, Note that in this fashion the subject does not see the 
mutilations until given the instructions to put them together, and that all of the 
pieces are presented accurately with one manipulation whereas in the traditional 


Figure 1 
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Figure 2 


method each piece had to be handled separately. Only photographs of the manikin 
are offered because the authors felt that illustration of one of the sets would suffice 
to demonstrate the project. 


CONSTRUCTION OF THE OBsECT-ASSEMBLY FORMBOARD 


Materials Used. (1) Plywood: Length 14”, Width 914”, Thickness 4”. 
Sheet Steel: Thickness 1 /32’’. 
Three Pieces, Hard Sheet Steel, 154%” x 9 5/16” 


Three Pieces, Sheet Steel Stripping, 37 x 1144” 


Method of ( utting Figure Outlines in Plywood. Lay out the pieces fuce down tn re- 
verse order on the plywood and trace in this fashion. This applies to all three form- 
boards (see Figure 1). This is necessary because the formboard panel which covers 
the pieces slides from left to right arid thereby necessitates a reversal of the pesiti 
of the board in presentation (see Figures 2 and 3). 


Cut tracings 44” deep and all 
a tolerance of at least 1/8’ 


Sand down edges and rough spots. 

Method of Cutting Hard Sheei Steel. In cutting hard sheet steel piece to make th: 
panel, allow 13 13/16” for the cover and 1 15/16” for the handle. Cut in two inch 
on each side at 13 13/16” length to make the handle (see Figures 1 and 2). ° 
stripping is used on the sides of the plywood to form inserts for the panel sheet 
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Figure 3 


SUMMARY 


In using the formboard presentation of the object-assembly mutilations of the 
Wechsler-Bellevue Intelligence Scale as described in this article, the authors have 
found that use of this procedure saves time, eliminates the traditional awkwardness 
of presentation, insures proper arrangement of the mutilations, and helps to main- 
tain continued rapport in the testing situation. 





THE ANCESTRY OF ID 
PHILIP L. HARRIMAN 


Department of Psychology, Bucknell University 
Lewisburg, Pennsylvania 


Joan Riviere’s ‘translation into English of Freud’s The Ego and the Id reflects 
the wide connotation of the term id. Nevertheless, there is a strong likelihood that 
an error is being perpetuated with regard to the derivation of this concept. Custom- 
arily, the word 7d is said to have been derived from the Latin demonstrative pronoun. 
Writers who have the background of a classical education point out that the word 
is the nominative case, singular number, neuter gender of the familiar paradigm 
ea, td. In the writings of Cicero id denotes “‘this, that, that thing, that which was 
previously mentioned.”’ By present-day convention in psychoanalytic writings, id is 
construed to mean the unorganized, inchoate, undisciplined impulses which obey 
nothing but the laws of elemental cyrenaicism or crass hedonism. The neuter impli- 
cation is often taken to be appropriate for designating the lack of personal responsi- 
bility inherent in the Jd. This aspect of the connotation seems to be a matter of 
general agreement among writers. If that were the only important signification of 
the term, 7d might be regarded as having been derived from the Latin demonstrative 
pronoun. No question would be raised. There would be, indeed, an etymological 
consistency between Jd and Ego. Id, however, connotes a static entity in classical 
Latin. Perhaps Groddeck, who suggested the term to Freud, meant to imply a 
dynamic foree. It is very likely that Freud readily accepted the term because zd has 
a dynamic signification attributable to some other derivation than’ the Latin de- 
monstrative pronoun, neuter gender, singular number. 

As a matter of fact, the term id came into general use among students of biology 
long before Freud commenced his medical studies. Inevitably, he must have heard 
the term frequently mentioned in university lectures dealing with the Weismannian 
revision of the ‘““Lamarckian error.’’ During the years 1868-1876, August Weisman: 
published a notable series of papers dealing with the mechanisms of heredity and 
with other major issues in biological theory. His sight having been impaired, he was 
forced to give up microscopical work and to devote himself to theoretical formula- 
tions. His influence was great throughout the university world, and he was generally) 
regarded as the greatest authority on the subject of biology. Naturally, therefore, 
Groddeck and Freud heard much about the Weismannian concepts during thi 
student days. It is likely that Freud gave ready acceptance to the use of 7d as an 
appropriate concept in psychoanalytic theory because he found it to connote a dyna- 
mic concept. If this conjecture have any validity, he did not intend to imply that h« 
took the word from the familiar Latin sources. On the contrary, id is used by Weis- 
mann to denote something like the gene in contemporary biological writings. 

The student will recall that Weismann postulated the existence of biophor 
as the ultimate supramolecular vital unit. They, in turn, are orgariized into dete 
minants. Both biophores and determinants are submicroscopic in size. Both of them 
exert dynamic influences upon the course and the nature of development. Th: 
determinants are clustered in the visible granules of the chromatin, »nd these hi 
termed ids. The chromosomes he considered to be made up of still larger units termed 
idants. Both ids and idants are to be observed through the microscope. Hence, the) 
are not hypothetical constructs but observable entities. The id, therefore, is th 
smallest observable determinant of those biological processes subsumed under t! 
terms growth and development. The effect of ids, of course, is dynamie and inte! 
active. Ids, in a sense, represent tendencies for development; and out of them aris 
all the differentiated structures of the mature organism. According to Weismann, thi 
growth process is the very antithesis of a static procedure that would lend itself | 
symbolization by a demonstrative pronoun in the neuter gender. Quite to the co! 
trary, the Weismannian concept of ids may be logically extended into psychologica 
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ory of the dynamic type. The basic point of view in Weismann is logically con- 
sistent with the dynamic principle of entelechy in the writings of Aristotle. It is very 
sible to believe that Freud was familiar with these concepts about the dynamic 
ture of hereditary determination of organisms. In fact, it is doubtful whether any 
European university student of biology could have failed to master Weismann’s 
theories and to acquire his terminology. They constituted a forceful negation to all 
views previously upheld in medical schools. Particularly were they readily accessible 
students in German-speaking countries. Freud could not have remained un- 
influenced by this theory of biological determinants in growth and development. 
He must have read and heard much about ids during the early years while he was a 
medical student and a research investigator in neurology. 
The derivation of id from Weismannian theory does appear to be in close accord 
ith its connotation in psychoanalytic theory. Were it to be referred to classical 
Latin, Freud might have justified his choice of the term. As a matter of fact, he 
tacitly assumed that id does have an appropriate dynamic connotation which is 
iseful both in accounting for biological and for psychological development. Of 
ourse, he invested the term with much broader significations than it ever had in 
iological theory. Perhaps his choice of id was a logical, meaningful! extension of a 
ord he had often encountered in earlier years. ( ertainly, both in Weismann’s writ- 
ngs and in psy hoanalytic theory id has a dy namic connotation. The Latin demon- 
strative pronoun, on the other hand, has a rather meaningless, neutral, static value 


I used to symbolize this Freudian reterrent. 





NOTE ON A HORNEY HYPOTHESIS 
WILLIAM SEEMAN 


Section of Neurology and Psychiatry, 
Mayo Clinic, Rochester, Minnesota 


INTRODUCTION 


The problem with which this paper is concerned is Horney’s hypothesis which 
states that “the basis of anxiety usually lies . . . in hostile impulses...” ©. P- ®) Por 
our purposes this may be stated in correlation language as a hypothesis asserting a 
significant association between hostility and anxiety. That Horney views her con- 
ceptual analysis of anxiety as being different from Freud’s analysis is suggested by 
her statement that “. . . the frequency with which anxiety is generated by sexual im- 
pulses is largely dependent on the existing cultural attitude towards sexuality. I do 
not see that sexuality as such is a specific source of anxiety ... A third point in which 
I find myself at variance with Freud is his assumption that anxiety is generated only 
in childhood ... and that anxiety occurring later in life is based on reactions which 
have remained infantile”. © Pp». 78-77) 

Although Horney nowhere presents evidence relevant to the validity of this 
formulation, it is an intriguing and plausible hypothesis. The present note is pri- 
marily intended to illustrate one way in which the theoretical hypothesis can be given 
empirical meaning in terms of a logically entailed deduction. If it were shown that a 
specific daydream could be established in a statistically reliable fashion as a ‘‘hostile” 
daydream, then, according to Horney’s formulation, there should be a significantly 
covarying anxiety. 


METHOD AND RESULTS 
Five daydreams known to be widely experienced “: * were selected. These were 
submitted to six judges (two psychoanalysts, two psychiatrists and two clinical psy- 
chologists) who were asked to select the daydream regarded as most hostile. The 
daydreams are as follows: 

Type 1: In this daydream you visualize yourself as performing or having 
performed some daring or heroic act or feat, such as rescuing a person from 
danger or certain death, or facing some tremendous danger yourself and over- 
coming it. As a result of this heroic act you are publicly admired and acclaimed, 
and your friends heap admiration on you. 

Type 2: In this daydream you visualize yourself as being extremely at- 
tractive. You are complimented on your appearance and your clothes. At 
gatherings and social functions you are the center of attraction, receiving at- 
tentions from many persons present, especially members of the opposite sex. 


Type 8: In this daydream you visualize yourself as being promoted in 
your job to a position of greater importance as well as greater remuneration. 
You may visualize yourself as directing activities, giving orders, and so forth 
You may see yourself at a banquet in your honor with officials making speeches 
about your abilities. 


Type 4: In this daydream you visualize yourself as greatly wronged or 
misunderstood by friends or parents or both. You visualize yourself as meeting 
some great misfortune, perhaps even death. When your friends or parents dis- 
cover that this great misfortune has occurred, they are sorry, even miserable. 

Type 5: In this daydream you visualize yourself talking back to, o! 
winning an argument against a person to whom in real life you would like to 
talk back or with whom you would like to argue, but somehow feel that you 
never can or dare. 
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Horney’s hypothesis lays down the following theoretical requirement: 
daydream which is most heavily loaded with hostility (by the criterion of statisti 
reliable agreement of ratings by trained and competent individuals) should « 
anxiety with a significantly greater frequency than do the other daydreams 

with regard to hostility turned out to be unanimously in agreement, w] 
the reliability of the criterion well beyond the 0.01 level of confidence. 1 
elevant data reporting anxiety associated with each of the daydream types are 
ented in table 1. Three samples of college students were asked to rate the degre 
anxiety assoc iated with each of the daydream types; no anxiety was indicated | 
ating of ‘‘O’’, some degree of anxiety by a rating of ‘‘1,” and considerable anxi 
rating of ‘‘2.”’ For purposes of economy the ‘‘1”’ and ‘‘2”’ ra are combi 
the table under the columns marked “Anxiety.” The differences between an 
associated with daydream 4, that rated unanimously as ‘‘most hostil 
s, and the anxiety ratings associated with the other daydream types are 
be ve nd the 0.01 level; there are twelve such statistically reliable difi 
four for each sample. These are presented in table 2. 


DIFFERENCES IN DeGrEeE oF Anxiery Assocratep Wirn “Hostite’” Daypream 
crery AssocriaTeD Wira OTHER DAaYpDREAMS IN THREE SAMPLES OF COLLEGE STUDENTS 


Sample 


Minnesota 1 
Minne sota 2 
Ohio State ) 29.9 


*Differences of the percentages given in table 1; all are significant beyond the 0.01 level 


SUMMARY 


A Horney hypothesis states that hostility and anxiety will be found to covary 
significantly. This paper has presented evidence to the effect that anxiety asso- 
ciated with the one of five daydreams rated as ‘‘most hostile’ (with agreement of six 
judges beyond the 0.01 level) is reliably greater than chance in three samples of col- 
lege students. These differences are in the direction theoretically required by th 
hypothesis and may thus be held to be relevant to its validity. 
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EDITORIAL OPINION 





REFLECTIONS ON EDITORIAL POLICIES 


With this issue marking the end of eight years of publication of this Journal, 
seems desirable to review the general objectives and policies which have controlled 
w actions. This attempt at self-evaluation is stimulated partially by several 
valuative reviews of the Journal’s contents from external observers but more 

particularly by our own editorial awareness of our responsibilities for periodically 
aluating and publicizing our policies. Every editor has large responsibilities in 
his role of steward of a publication enterprise serving the interests of science and 
ublic welfare. It is almost impossible to administer publication policies in a com- 
pletely neutral, unbiassed and objective manner, since with such a large supply of 
manuscripts offered for publication, an important act of decision must made | 
the acceptance or rejection of every single paper. As experiences with publi 
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services. Was this growing-up period of clinical psychology under conditions o 
great stress to be accomplished by trial-and-error behavior evolving slowly over a 
long period of time, or was it possible to telescope the process by the intensive appli 
cation of insightful learning derived from study of the development of older sciences 
which had already outlived their growing pains? Was it not possible to save many 
years of experimentation and countless errors by learning from the experience of 
others? It was within this framework of events that we reached an individual deci- 
sion to launch a new independent Journal which would serve as a loyal opposition 
and ‘“‘needler” to the necessarily more conservative official organizations and publi- 
cations. 

The launching of the project in 1945 under wartime condition with private 
financing was accomplished only under great and unforeseen difficulties. The great- 
est difficulty was in securing sufficient papers to publish the first issue (and even the 
first three volumes) on schedule. For example, with the first issue and later on sev- 
eral occasions, we had sufficient papers promised or scheduled to get the issue out, 
only to have insufficient material arrive in publishable condition. In spité of the 
contention voiced by some groups that insufficient publication outlets exist in clinical 
psychology, the fact seems to be that ample facilities exist to publish all the good 
papers (and also a large quota of not so good papers) providing that they are edited 
so as to avoid padding and undue theorizing. In any case, your editor received dur- 
ing the fir ’t years of publication a large number of second-rate papers many of 
which were tattered and torn dust-stained manuscripts which had withstood the 
creasings of many previous editorial rejections but which were still being submitted 
by parents who hoped that sometime somewhere somebody might accept them for 
publication. Under these conditions (and also because we believed that many things 
needed to be brought out in print), your editor arranged several symposia and mono- 
graphs and also included a series of papers on directive psychotherapy and other 
topics which seemed important to us. Although we have recognized the validity of the 
advice once given us by the late Willard Valentine not to publish our own material, 
many of the early issues would never have appeared on schedule except by addition 
of such material to make up the duly advertised number of pages. Fortunately, this 
condition has remedied itself in the last three years after a reliable flow of good manu- 
scripts finally established itself. Operating under such conditions, it was inevitable 
that some sacrificing of editorial standards and principles was necessary simply to 
accomplish the practical matter of getting the Journal out so that the good papers 
would be published and our obligations to subscribers and others satisfied. That 
this was accomplished to a satisfactory degree is partially attested to by steadily 
increasing influence and subscription lists. 

A second and continuing source of difficulty has been the problem of securing 
editorial board parti¢ipation in an active shaping of policy and editorial contribu- 
tion. While the editing of a scientific journal should theoretically be a group en- 
deavor, it is in practice very difficult to secure more active participation from board 
members than occasionally submitting or evaluating manuscripts. The editorial 
pages of this Journal have always been open to any board member (and also on 
occasion to unsolicited contributors) but only on a few occasions have any of them 
felt inspired to express themselves. The board has made one important econtribu- 
tion, however, in evaluating and criticizing all editorial material before it is pub- 
lished, an agreement having been made that nothing will be published except with 
the unanimous approval as to general acceptability by the entire board. Having 
adopted an active editorial policy, we would like nothing better than to run an open 
forum of editorial opinion, but failing that, there is nothing left but to continue the 
department alone or to abandon it completely. The latter step has been considered 
on several occasions, but the steady flow of commendatory letters from readers has 
persuaded us to continue it. 

A third problem has involved the evolution of editorial standards both for the 
acceptance of manuscripts and for their editing according to the style of the Journal. 
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Selection problems are influenced by our opinion that there is currently only a very 
limited amount of high-grade research being performed in clinical psychology. Al- 
though the calibre and quantity of research is slowly improving, only a small num- 
ber of really high grade manuscripts are submitted. Although the official Journals 
have slightly better material to choose from, most editors receive a few outstanding 
manuscripts in each quarter and must fill out their issues with secondary material 
of more limited interest and research significance. The best an editor can do there- 
fore is to publish a representative cross-section of what he is currently receiving 
while at the same time stimulating and soliciting the type of manuscripts which he 
would really like to receive. We have made special efforts to keep this Journal stim- 
ulating and readable not only by manuscript selection policies but also by requiring 
authors to (a) condense material to a brief statement of method and results, (b) to 
simplify textual, tabular and graphic presentations of data by avoiding undue dupli- 
cation, (c) by eliminating over-extensive statistical data due to the fact that we 
assume that contributors are sophisticated enough to make their calculations cor- 
rectly, (d) by rigorously deleting imaginative theorizing, and (e) by attempting to 
be genuinely eclectic and thereby avoiding undue adherence to any school of thought. 
We have insisted on brief concise research reports because we believe that our read- 
ers are interested in a wide coverage of all important trends in the field rather than 
in long detailed reports from a few authors. In our opinion, there is very little re- 
search being done in the field which cannot be fairly encompassed in four or five 
printed pages. 

A statistical analysis of evaluative correspondence received from our readers 
since 1945 would result in a bimodal distribution of opinions skewed more strongly 
on the side of those who commend our efforts. As would be expected, our editorial 
opinions have stimulated the most numerous reactions since many of them have been 
of admittedly controversial nature. We have a vociferous group of loyal adherents 
who appear to support our ideas wholeheartedly, and on the other hand we have 
numerous critics some of whom are so emotionally involved and personally antagon- 
istic as to discontinue their subscriptions and express book-buruer sentiments. We 
regret the latter attitudes since it also reflects on our innocent contributors who 
have had nothing to do with the controversial editorial opinions. Such a situation 
could only have been anticipated in relation to an editorial policy which has dealt 
with as many controversial topics as we have. In partial extenuation, we wish to 
assure our readers that our motives have been the best even though at times our 
judgment may have erred. In our own professional development, we have been 
perennially embarrassed over the falibility of attitudes which we held so true only 
last year. We are not satisfied with ourself or with the state of perfection of the 
Journal. But we continue to derive some solace over having been the means of bring- 
ing many ideas to the attention of the profession, and we will be satisfied if only 
fifty-one per cent of them bear some ultimate fruit. 
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Sy mposium. The Bioloay of Me ntal Health a? d Disease. New York: Paul B. Hoeb: 
1952. Pp. 684. $10.00. 


These are the proceedings of the 27th Annual Conference of the Milbank Mer 
orjal Fund which was held in November 1950 at the New York Academy of M: 
cine. Thirty-eight principal papers are included with commentaries by 108 \ 
tributors representing all of the basic and applied fields in psychologic al science. A 
of the papers in this volume are so authoritative and up-to-date that it will serv: 
an important reference work for advanced students for some years to come. 
Fercuson, LEONARD W. Personality Measurement. New York: McGraw-Hill, 1952 

Pp. 457. $6.00. 

The author is associated with the Aetna Life Insurance and affiliated Com- 
panies. This is an authoritative textbook based solidly on the experimental-statisti 
cal approach to personality measurement. Its organization is markedly different 
from older books in the field in that it places greater emphasis on measures of inter- 
ests and attitudes, adjustment and performance rather than upon psychometrics 


Artuur, Grace. The Arthur Adaptation of the Leiter International Performanc 

Scale. Washington: Psychological Service Center Press, 1952. Pp. 73. 

This is a manual outlining the rationale and directions for administering thx 
Arthur adaptation of the Leiter Scale. Detailed scoring instructions with illustra- 
tions are given for each age level. 

Woxrr, Cuartotre. The Hand in Psychological Diagnosis. New York: Philosophi- 
cal Library, 1952. Pp. 218. $7.50. 

Dr. Wolff is a fellow of the British Psychological Society who has been making 
comparative morphological and psychological studies of the human hand for mor 
than 15 years. This book presents a large amount of statistical data which she has 
collected concerning variations in the structure of the hand occurring in various con- 
stitutional types, in normal persons, in mental deficiency and in mental illness. In- 
cluded are measurements and observations derived from general inspection, measure 
ment of the hands in relation to height and weight, endocrine factors, examinatior 
of fingers and nails, and inspection of the creases of the palm. Although the fields of 
palmistry and chirognomy are currently in scientific disrepute, this book represents 
a serious scientific attempt to collect and correlate some long-known observations 
concerning the hand in such conditions as mongolism. 

Ketiy, E. Lowert and Fiske, Donatp W. The Prediction of Performance in Clin- 

ical Psychology. Ann Arbor: University of Michigan Press, 1951. Pp. 310. 

This is the report of a five year research program sponsored by the Veterans 
Administration and directed at the evaluation of techniques for the selection of pro- 
fessional personnel in the field of clinical psychology. The scope of the research was 
greater than any previously attempted and resulted in some very interesting find- 
ings comparing clinical and objective approaches to prediction. Every clinical 
psychologist should study the methods and conclusions reached in this report. 
JeRsHILD, ArTHUR T. In Search of Self. New York: Bureau of Publications, Teach- 

ers College, 1952. Pp. 147. $2.75. 

This little volume by a professor of education at Teachers College is directed 
toward the teaching profession in encouraging them to explore the role of the schoo! 
in promoting self-understanding. It outlines how the teacher may go about ele- 


mentary counseling activities in studying the child in order to help him understand 
himself. 
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SuosrroM, Everett L. and Brammer, LAWRENCE M. The Dynamics of the Counsel- 
ing Process. New York: McGraw-Hill, 1952. Pp. 213. $3.50. 

Dr. Shostrom is head of the Psychology Department at George Pepperdine 
College, and Dr. Brammer is Guidance Officer and assistant professor of psychology 
at Sacramento State College. This book outlines an eclectic approach called self- 
adjustive counseling which seeks to aid the client in becoming more self-directive and 
self-responsible. To assist the client to alter his self-concepts in more mature healthy 
directions, the authors propose a combination of nondirective and directive methods. 
This book is oriented primarily to counseling under educational auspices, ane most 
of the techniques are illustrated with materials from high school or college level 
counseling. The primary emphasis is on rather superficial level educ stional, voca- 
tional and educational problems with little or no attempt to deal with depth prob- 
lems. 


Bycuowski, Gustav and Dersprert, J. Lovrse (Eds). Specialized Techniques 

Psychothe rapy. New York: Basie Books, 195! . dtl. $5.00. 

The Psychiatric Forum Group consists of an informal monthly gathering of 
psychiatrists and workers in related field meeting in New York City to interchange 
the latest approaches in the field. Seventeen distinguished authors have contributed 
papers on specialized techniques in psychotherapy, including Hocu and PoLaTIn 
on Narcodiagnosis and Narcotherapy, LINDER on Hypnoanalysis as a psychothera- 
peutic technique, E1senBup on the Telepathy Hypothesis in Psychotherapy, 
Max M. Stern on Free Painting as an Auxiliary Technique, Spornirz on group 
therapy, MITTELMANN on Simultaneous Treatment of Both Parents and Child, 
Beata Rank on the psychoanalysis of young children, Despert on child schizo- 
phrenia, FROoMM-REICHMANN on the psychoanalytic treatment ol schizophrenia, 
BycHowskI on psychotherapy with the crippled and disabled, I. Perer GLAUBER 
on dynamic therapy for the stutterer, RurH Fox on psychotherapeutics of aleoho- 
holism, Davin ABRAHAMSEN on the treatment of sex offenders, MELITA SPERLING 
on treatment in psychosomatic medicine, Vicror W. ErseNsTerN on the differential 
psychotherapy of borderline states, BeLLak on the emergency treatment of de- 
pressions, and Prorrowsk!i and SCHREIBER on Rorschach changes during analytic 
therapy. 

AvusuBEL, Davin P. Ego Developme nt and the Personali ily Disorders. New York: 

Grune and Stratton, 1952, Pp. 564. $10.00. 

The author is associated with the Bureau of Research and Service of the College 
of Education, University of Illinois. The book consists of a series of essays present- 
ing the historical development of ego psychology, its dynamics, crises and failures 
and failures of ego development,: the contribution of parent-child relationships to 
ego development, the relation of anxiety, the development of conscience, and a de- 
velopmental approach to psychopathology. The concept of satellization is introduced 
to describe a process of reorganization of ego structure whereby the child identifies 
with dominant parental figures in order to compromise between the two unfavorable 
alternatives of maintaining ideas of omnipotence or of grossly devaluating the self 
to meet the demands of reality. Various consequences of healthy satellization are 
discussed on theoretical levels, together with examples of failure of satellization. 


Ames, Louise B.; LEARNED, JANET; MerrRaux, Ruta W.; and WALKER, RIcHARD 
N. Child Rorschach Responses. New York: Paul B. Hoeber, 1952. Pp. 310. 
$7.50. 

This important book from the Gesell Institute of Child Development presents a 
comprehensive longitudinal study of Rorschach responses from ages two to ten years 
based on findings from a group of 650 children. Part one involving the first third of 
the book presents a review of the literature, outline of research methods and scoring, 
and discussion of the various determinants. The latter two thirds are given over to 
detailed presentations of typical patterns and illustrative cases at each age level. 
This source book is a most important milestone in the objectification of Rorschach 
responses. 
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ALEXANDER, FRANZ and Ross, Heusen, (Eds.). Dynamic Psychiatry. Chicago: Uni- 
versity of Chicago Press, 1952. Pp. 578. $10.00. 

Fifteen leaders, including 13 psychiatrists, 1 clinical psychologist and 1 anthro- 
pologist, have contributed papers on various theoretical and clinical aspects of dy- 
namic psychiatry. This book continues the current tradition which identifies ‘“dy- 
namic” thinking in psychiatry with psychoanalysis. Within this framework, this 
book is an authoritative summary of current opinion among analytically oriented 
workers. 

BycuowskI, Gustav. Psychotherapy of Psychosis. New York: Grune & Stratton, 
1952. Pp. 328. $5.75. 

The author is assistant clinical professor of Psychiatry at the New York Uni- 
versity College of Medicine. It consists of 34 chapters discussing his experiences in 
dealing with various types of psychotic reaction. Its most important contribution is 
the demonstration that many psychoses involve personality reactions which can b¢ 
dealt with by psychotherapy. 


Gortow, Leon; Hocu, Erasmus, L.; and Tetscnow, Eart F. The Nature of Non- 
directive Group Therapy. New York: Bureau of Publications, Teachers College, 
1952. Pp. 143. $3.25. 

This monograph presents a research evaluation of nondirective group therapy 
as applied to a group of 17 graduate students at Teachers College who requested help 
with personal problems. In spite of the smallness of the sample, the results of the 
project are significant because of the thoroughness of the statistical analysis. Special 
emphasis was given to the nature of the group process, the analysis of the behavior 
of the members as therapists, and of the role of the leader. 


Kris, Ernsv: Psychoanalytic Explorations in Art. New York: International Uui- 
versities Press, 1952. Pp. 358. $7.50. 

Dr. Kriss is associated with the Child Study Center of the Yale University 
School of Medicine. In this book he presents the results of more than 25 years of 
study in the field of the psychology of art in the form of a series of stimulating 
essays. Part one deals with the history of art with special emphasis on the potential 
contributions of psychoanalysis. Part two deals with the art of the insane with de- 
tailed consideration of the cases of a psychotic sculptor of the 18th century and of 
a schizophrenic artist. Part three presents some observations on the psychology of 
comic art and caricature, with comments on laughter as an expressive process. Part 
four discusses problems of literary criticism, and a concluding section of two chapters 
offers reflections on the psychology of creative processes. There is an appendix with 
79 illustrations of artistic productions which are commented upon in the text. 
Froscu, JoHN (Ed.) The Annual Survey of Psychoanalysis. Vol. I. New York: 

International Universities Press, 1952. Pp. 556. $10.00. 

This is the first of a proposed series of annual surveys of psychoanalysis con- 
tributed by leading authorities in the field who present detailed discussions of the 
most significant papers and books which appeared during 1950. 

Wo.rr, Werner. The Dream—Mirror of Conscience. New York: Grune and Strat- 
ton, 1952, Pp. 348. $8.50. 

Dr. Wolff is professor of psychology at Bard College and a prolific writer in the 
field of abnormal psychology. In this book he reviews the historical development of 
dream psychology, summarizes the most significant theoretical viewpoints with 
special emphasis on the contributions of psychoanalysis, and presents an interesting 
series of illustrations of typical dream patterns. 

VinackE, W. Epgar. The Psychology of Thinking. New York: McGraw-Hill, 1952. 

Pp. 392. $5.50. 

The author is assistant professor of psychology at the University of Hawaii. 
The book is an experimentally oriented, systematic study of the psychology of think- 
ing covering the historical and theoretical background of the field, reasoning, imagin- 
ation and personalized thinking. 
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